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WATER-SOLUBLE LIGAND-BINDING PROTEINS AND ANALOGS OF LIGAND- 
5 GATED ION CHANNELS, CRYSTALS THEREOF AND THEIR USE FOR 

SCREENING LIGANDS OF LIGAND-GATED ION CHANNELS 



10 

SUMMARY OF THE INVENTION 

Novel water-soluble ligand-binding proteins have been identified and isolated, which 
have a ligand-binding profile substantially similar to that of ligand-gated ion channels. 
DNA molecules encoding such proteins have been cloned and characterized. The 

15 biological and structural properties of these proteins are disclosed, as is the amino 
acid and nucleotide sequence. The recombinant DNA molecules, and portions 
thereof, are useful for isolating homologues of the DNA molecules, identifying and 
isolating genomic equivalents of the DNA molecules, and identifying, detecting or 
isolating mutant forms of the DNA molecules. Using a recombinant expression 

20 system functional DNA molecules encoding the water-soluble ligand-binding proteins 
as well as chimeras have been functionally produced. Furthermore, the water-soluble 
ligand-binding proteins could be crystallized revealing the three dimensional (3D) 
structure and enabling the modeling of the 3D structure of the ligand-binding domain 
of ligand-gated ion channels. The invention is further in the field of the development 

25 of new drugs that are capable of selectively intervening in neuronal signaling 
pathways. The invention is more in particular concerned with providing new 
analogues of the channel-coupled receptors, crystal structures thereof and to their 
use in screening ligands for these receptors. 

30 Several documents are cited throughout the text of this specification either by name 
or are referred to by numerals within parenthesis. Full bibliographic citations may be 
found at the end of the specification immediately preceding the claims. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated herein by reference; however, there is no admission 

35 that any document cited is indeed prior art as to the present invention. 
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BACKGROUND OF THE INVENTION 

The communication in the central nervous system (CNS) occurs through a complex 
interaction of electrical and chemical signals. Molecules bearing chemical information 
are called neurotransmitters. The chemical information is converted in electric 
5 currents on the post-synaptic membrane, which is specialised in recognising and 
binding neurotransmitters by means of protein receptors. The specific binding of a 
ligand to one type of such receptors, the ionotropic receptors, induces a fast opening 
of the ion channel coupled to the receptor. An important group of ionotropic receptors 
is the superfamily of the channel-coupled receptors, also referred to as ligand-gated 

10 receptors, including the 7-amino-butyric acid (GABA A ) receptor, the glycine receptor, 
the serotonin-3 (5-HT3) receptor and both neuronal and muscle-type nicotinic 
acetylcholine receptors (nAChR). These receptors share certain structural features 
such as (1) a 15-residue cysteine loop between amino acids 128 and 142 
corresponding to the Torpedo AChR a unit, (2) four trans-membrane domains, (3) 

1 5 similar subunit arrangements, and (4) homologies in amino acid sequence. Activation 
of these receptors causes a change in electrical current and hyperpolarisation of the 
cell membrane and consequently an inhibition of the electrical activity of the cell. The 
GABA A receptor and the glycine receptor are coupled to a chloride-selective channel, 
and thus the inhibition of the electrical activity leads to inhibition of the cell response. 

20 On the other hand, activation of the 5-HT3 receptor and the nAChRs provokes an 
excitatory response on the cell because they are connected to a cation-selective 
channel (Na+, K+, Ca2+). The AChRs are the best studied of the ligand-gated 
receptors; for a review, see Arias, Brain Research Reviews, 25 (1997)133-191 and 
Arias, Neurochem. Int. 36 (2000), 595-645). Mutations in these ligand-gated ion- 

25 channels (LGICs) lead to diseases such as congenital myasthenia gravis, epilepsy, 
startle syndrome and alcohol sensitivity (Vafa and Schofield, Int. Rev. Neurobiol. 42, 
285-332; 1998). NAChRs mediate nicotine addiction in chronic tobacco users. Since 
nicotine binding to these receptors also has a positive effect on Alzheimer's disease, 
Parkinson's disease and schizophrenia these receptors present an important drug 

30 target (Paterson and Nordberg, A. Neuronal nicotinic receptors in the human brain. 
Prog. Neurobiol. 61, 75-1 1 1 ; 2000). 

The development of new active compounds that can selectively or - as the case may 
be - a-selectively bind to the channel-coupled receptors, is of utmost importance for 
the understanding of the processes occurring in the nervous system and for the 
35 treatment of disturbances of neural conditions. The development of such active 
compounds requires the availability of a reliable model system for the corresponding 
receptors. The primary structural features (amino acid sequences) of the various 
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receptors have been largely elucidated by now. Certain subunits of the AChRs have 
been found to be determinant in the pharmacological specificrty or affinity of the 
receptor for its ligand (Corringer et al., J. Neuroscience 18 (1998), 648-657). 
However, the study of the ligand binding properties of the receptor proteins is 
5 hampered by the fact that the spatial structure of the proteins - which is decisive in 
the binding of ligands - is still unknown. This is partly because crystallisation of the 
receptor proteins has been unsuccessful up to now. 

The above-defined technical problem is solved by the present invention by providing 
the embodiments characterized in the claims. 
1 0 Accordingly, in one aspect the present invention relates to a water-soluble protein 
derived from a mollusc being capable of binding a ligand of a ligand-gated receptor. 

It has been found according to the invention that acetylcholine-binding proteins 
(AChBP) of certain molluscs show a surprising structural similarity with the channel- 

15 coupled receptors on the one hand and have interesting physical properties, such as 
water-solubility, on the other hand. The molluscan AChBPs are capable of forming 
multimers, especially pentamers, and of binding specific toxins such as cc- 
bungarotoxin. These multimers may be homogeneous (identical units) or 
heterogeneous (different units). These properties make them eminently suitable as 

20 model systems for studying the binding of candidate ligands to the channel-coupled 
receptors. It has been possible to produce these molluscan AChBPs in recombinant 
systems, thus allowing convenient and large-scale production thereof. Moreover, it is 
feasible to construct hybrid proteins sharing the physical properties of the mollusc 
AChBP with the pharmacological properties of the (human) channel-coupled 

25 receptors, thus providing new dedicated tools for screening ligands for these 
receptors. 

The AChBP is a naturally occurring analogue of the extracellular domains of the a- 
subunits of the neuronal nicotinic acetylcholine receptors (nAChRs). In contrast to the 

30 nAChRs, it lacks domains to form a transmembrane ion channel, but alike the 
nAChRs it assembles into a homo-pentamer (Figure 6). Moreover, AChBP has 
ligand-binding characteristics that are typical for a nicotinic receptor. The 3- 
dlmensional structure of AChBP was solved by X-ray crystallography at 2.7A 
resolution (current Rfector = 27.9 %, R tre e = 30.0 %). In crystals, as in solution, AChBP 

35 forms a stable homo-pentamer with dimensions comparable to those of the ligand- 
binding domain of the nAChR, as determined in EM studies by Unwin, Struct. Struct. 
Biol 121 (1998), 181-190. The high-resolution crystal structure of AChBP, along with 
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biochemical and pharmacological data, supports the extrapolation of AChBP as a 
good mimic of ligand-binding domains of ligand-gated ion channels including nAChR, 
5-HT3R, GABAa, c R and GlyR. 

Four AChBPs according to the present invention are exemplified herein, isolated and 

5 cloned from the CNS of Lymnaea stagnalis (L-AChBP_T1 and L-AChBPJT2) and 
Bulinus truncatus (B-AChBP_T1 and B-AChBP_T2). L-AChBP_T1 and 2 are 229 
amino acid proteins with a signal sequence of 19 amino acids (224 and 21 amino 
acids, respectively, for B-AChBP_T1 and 2; see also Figure 1) and have sequence 
homology with the extracellular domains of the subunits of ligand-gated ion channels 

10 (Figure 3), in particular with those of the nAChRs (Figure 4 and 5). The mass of the 
purified AChBP from Lymnaea has been determined by mass-spectrometry. The 
glycosylated form has a mass of about 24720 Da and the de-glycosylated form of 
about 23832 Da. In SDS-PAGE the glycosylated AChBP migrates between the 14 
and 26 kDA marker proteins. Hydrophopicity plots of the AChBPs are shown in 

15 Figure 2, which reveal those regions of the ligand-binding proteins that are 
particularly hydrophilic and thus may be replaced at least in part or essential amino 
acids thereof in the ligand-binding domain of the ligand-gated ion channel. Sequence 
conservation is particularly high in the so-called loop areas (reviewed by Arias, 
Neurochem. Int. 36 (2000), 595-645), which contain the residues involved in ligand- 

20 binding. The cysteine residues characteristic for the Cys-loop family of ligand gated 
receptors are conserved in AChBP. Also the double cysteine typically found in the 
alpha subunits of the nAChR is present AChBP protein sequence ends at the 
position where in the nAChRs the first predicted transmembrane domain would start. 
The ligand-binding characteristics of AChPBs are described in Example 4 and 

25 summerized in Table 2. 

The terms "channel coupled receptors", "ligand-gated receptor", "ligand-gated ion 
channel" are used interchangeable herein. However, in context with the natural 
occurring, in particular human molecules the term "ligand-gated ion channel" is 

30 preferably used. The water-soluble ligand-binding protein of the invention can also be 
characterized as a ligand-binding protein having at least 10%, more preferably at 
least 12%, still more preferably at least 15% and most preferably at least 20% amino 
acid sequence identity to a vertebrate ligand-gated ion channel but missing any 
trans-membrane domain. A ligand-gated receptor of the present invention is 

35 characterized by having substantially the same ligand-binding characteristics of a 
vertebrate, preferably mammalian, most preferably human ligand-gated ion channel 
but comprising at least one alteration 'in the original amino acid sequence, said 
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alteration resulting in the presence of an amino acid determining or contributing to 
the water-solubility of the water-soluble ligand-protein found in molluscs, in particular 
snails such as those described in more detail below. 

5 The terms "ligand-binding protein", Tigand-binding domain" and 'Tigand-binding 
receptor" are meant to at least include the portion of a water-soluble ligand-binding 
protein or corresponding modified ligand-gated ion channel required for binding a 
ligand. Minimally the ligand-binding domain consists of a peptide containing that 
domain. However the use of this term is meant to include a ligand- binding domain or 
10 protein that is comprised by a larger portion of, for example, ligand-gated ion 
channel, such as a fully reconstituted nicotinic acetylcholine receptor. 

As shown in Figure 3 the nicotinic acetylcholine receptor (nAChR) belongs to a well- 
understood member of the ligand-gated ion channels superfamily. The members of 

15 this signaling protein group, including 5-HT3, glycine, GABA A , and GABAc receptors, 
are thought to share common secondary, tertiary, and quaternary structures on the 
basis of a high degree of sequence similarity. Therefore, it is expected that the novel 
findings in respect to the exemplified AChBP equally apply to the other members of 
the mentioned ligand-gated ion channels superfamily. Thus, either water-soluble 

20 protein being capable of binding a ligand of any of those ligand-gated ion channels 
may be found in molluscs or the present 5-HT3, GABA A , and glycine receptors can 
be modified such as to substantially retain their binding affinity. 
Accordingly, the ligand of the water-soluble ligand-binding protein is preferably 
acetylcholine, gamma-amino-butyric acid (GABA), glycine, nicotine or serotonin. 

25 Isolation of such water-soluble ligand-binding proteins can be done as described in 
Example 1 for the AChBP of the present invention. Instead of a-bungarotoxin other 
known ligands or can be used for affinity purification. Most preferably, water-soluble 
ligand-binding protein of the invention is a acetylcholine-binding protein (AChBP). 
Preferably, the ligand-binding protein displays substantially the binding 

30 characteristics shown in Table 2. 

The acetylcholine-binding proteins to be used according to the invention are originally 
derived from aquatic molluscan species, especially species from the class of the 
snails (Gastropoda), in particular from the order of the lunged snails (Pulmonata). 
The order of the Pulmonata is divided into the suborders of the Basommatophora 

35 (mostly aquatic snails), Systellommatophora and Stylommatophora (mostly land 
snails). The Basommatophora include the families of the Acroloxidae (e.g. genus 
. Acroloxus), Lymnaeidae (e.g. genera Galba, Stagnicola, Radix and Lymnaea), 
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Physidae (e.g. genera Physa and Aplexa) and Planorbidae (e.g. genera Planorbis, 
Anisus, Ancylus, Gyraulus, Biomphalaria and Bulinus). Examples of suitable species 
are Lymnaea stagnalis (pond snail) and Bulinus truncates. The isolation of the 
AChBPs from these snails, cloning of the cDNA encoding these AChBPs and th£ir 

5 characterization including the full amino acid sequences is described in the 
examples. The cDNA and amino acid sequences of the AChBPs of Lymnaea 
stagnalis are depicted in SEQ ID Nos. 1 and 2 (L-AChBP_T1) and SEQ ID Nos. 3 
and 4 (L-AChBP_T2). Those of Bulinus truncatus are depicted in SEQ ID Nos. 5 and 
6 (B-AChBP_T1) and SEQ ID Nos. 7 and 8 (B-AChBP_T2). Features of these 

1 0 proteins are further described in the examples and the accompanying figures. 

While a water-soluble ligand-binding protein derived form a Pulmonata species, 
preferably from a Basommatophora species is preferred, it will be appreciated that 
the present invention generally relates to any water-soluble protein being capable of 
binding a ligand of a ligand-gated receptor comprising an amino acid sequence 
selected from the group consisting of: 

(a) an amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 
6 or 8 or a functional equivalent thereof, or a fragment of at least 5 
continuous amino acids thereof; 

(b) an amino acid sequence having at least 30% amino acid identity to the 
amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8; and 

(c) an amino acid sequence resulting in a protein which is detectable by a 
monoclonal or polyclonal antibody which recognises, preferably with a binding 
affinity of at least 10' 7 M, a protein comprising an amino acid sequence of (a) 
or(b). 

Identity or similarity, as known in the art, are relationships between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, identity also means the degree of sequence 
30 relatedness between polypeptide or polynucleotide sequences, as the case may be, 
as determined by the match between strings of such sequences. Both identity and 
similarity can be readily calculated (Computational Molecular Biology, Lesk, ed., 
Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome 
Projects, Smith, ed., Academic Press, New York, 1993; Computer Analysis of 
35 Sequence Data, Part I, Griffin and Griffin, eds., Humana Press, New Jersey, 1994; 
Sequence Analysis in Molecular Biology, von Heinje, Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov and Devereux, eds., M Stockton Press, New 
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York, 1991). While there exist a number of methods to measure identity and similarity 
between two polynucleotide or two polypeptide sequences, both terms are well 
known to skilled artisans (von Heinje, supra; Gribskov and Devereux, supra; and 
Carillo and Lipman SIAM J. Applied Math. 48 (1988), 1073). Methods commonly 

5 employed to determine identity or similarity between sequences include, but are not 
limited to those disclosed in Carillo and Lipman; see supra. Preferred methods to 
determine identity are designed to give the largest match between the sequences 
tested. Methods to determine identity and similarity are codified in computer 
programs. Preferred computer program methods to determine identity and similarity 

10 between two sequences include, but are not limited to, GCG program package 
(Devereux et al. ( Nucleic Acids Research 12 (1984), 387), BLASTP, BLASTN, psi 
BLAST and FASTA (Atschul et al., J. Molec. Biol. 215 (1990), 403). 

In another embodiment, the present invention relates to a water-soluble protein being 
1 5 capable of binding a ligand of a ligand-gated receptor comprising 

(a) . at least the amino acids of the water-soluble protein described above 

determining solubility of said protein, in the same or 
corresponding positions as in said protein; and 

(b) at least 4 amino acids determining binding to said ligand. 

20 

Protein expression studies have shown that wild-type AChBP of the mollusc 
Lymnaea stagnalis can be produced in Pichia pastoris yeast. The yeast cells express 
AChBP in a homopentameric form and secrete the protein complex into the medium. 
The large amounts of AChBP per volume of medium produced (up to 2 mg per liter 

25 medium) and the large volumes of yeast that can be cultured allow a large-scale 
production of AChBP. Besides the wild-type AChBP, various AChBP mutants have 
been produced in Pichia pastoris. These include mutants containing the following 
single point mutations (the numbers refer to the amino acid position in the AChBP 
sequence of Lymnaea stagnalis depicted in SEQ ID No. 2 counted from the first 

30 amino acid of the signal peptide; the letter before the number indicates the original 
amino acid and the letter after the number indicates the mutant amino acid) N85D, 
H164Y, D194N, Y204P, Y21 1P and D213N. 

Thus the invention pertains to water-soluble proteins derived from molluscan, 
preferably acetylcholine binding proteins (AChBP's), which are capable of forming 
35 multimers, and are capable of binding a ligand of a ligand-gated receptor. These 
proteins comprise, on the one hand, at least of the amino acids of the AChBP 
determining solubility of the AChBP in the same positions as in the AChBP, and, on 
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the other hand, amino acids determining binding to the ligand of the ligand-gated 
receptor. The degree of identity with the molluscan AChBP sequence can be defined 
by amino acid identity, of at least 15%, preferably 20%, more preferably 30%, still 
more preferably 40%, preferably at least 50 or even at least 60%, preferably more 
5 than 70%, more preferably more than 80% and most preferably at least 90% identity, 
or more, as determined, e.g., using the art-known BLAST algorithm. The amino acids 
determining binding to the ligand should comprise at least 4 amino acids, preferably 
at least 6 or even at least 8 amino acids, including a series of at least 3 or 4 amino 
acids, corresponding to the receptor sequence and preferably differing from the 

1 0 corresponding AChBP amino acids. Preferred embodiments of these proteins are 
further defined below. Usually, the water-soluble ligand-binding protein or domain as 
part of a for example chimeric ligand-gated ion channel will comprise 200-240 amino 
acids. The ligand is preferably acetylcholine, nicotine, lophotoxin, d-tubocurarine, 
carbamylcholine, galanthamine or epibatidine. 

15 Said ligand-gated receptor can be derived from an arthropod (preferably insect), a 
plant (preferably a higher plant, most preferably a seed plant) or a chordate 
(preferably a mammalian, most preferably human), preferably said ligand-gated 
receptor is a nicotinic acetylcholine receptor. 

20 Usually, the said amino acids in the water-soluble ligand-binding proteins of the 
invention, which determine solubility are in the same positions as in the AChBP 
having the amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 6 or 8. 
The solubility determining regions are based on solvent accessibility in structure. The 
respective amino acid residues can be chosen for example according to Figure 10 or 

25 11 in which the solvent accessible regions are indicated. Preferably, the water- 
soluble ligand-binding protein of the invention comprises an amino acid sequence 
having at least 40% amino acid identity to the amino acid sequence of the mature 
AChBP comprising the amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8, 
in which the ligand binding amino acids have been replaced with the corresponding 

30 amino acids of a ligand-gated receptor. 

In one embodiment of the protein of the invention said solubility-determining amino 
acids (a) comprise hydrophilic amino acids (Asp, Glu, Arg, Lys) from the sequences 
20-44, 73-81, 86-92, 112-120, 135-152, 166-189, 196-20, 209-213, and/or 219-227 
of SEQ ID No. 2. 

35 The amino acid sequences of L-AChBP_T1 (SEQ ID No. 2) and T2 (SEQ ID No. 4) 
are almost similar. For the sake of clarity, reference is always made to L-AChBP_T1 
(SEQ ID No. 2). However, all references to amino acid residues within are valid for 
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both T1 and T2, with the noticeable exceptions of Arg(167) becoming Gly(167) and 
and Thr(203) becoming lle(203). Furthermore, regarding the amino acid residues 
(domains) from L-AChBP_T1 and the corresponding residues from B-AChBP the 
following list provide those amino acid positions in which L-AChBP and B-AChBP 

5 differ. All amino acid residue numbers below correspond to their position within the 
amino acid sequence of the immature protein (numbering starting at methionine (1). 
One could also start numbering at the start of the amino acid sequence of the mature 
sequence (L(1)DRAD for L-AChBP and Q(1)IRW for B-AChBP). When using this 
second method (1st amino acid of the mature seq. = position 1) simply subtract 19 

10 from the L-AChBP position numbers and 21 from the B-AChBP position numbers, for 
example Asp(36) becomes Asp(17) for L-AChBP) and Asp(15) for B-AChBP. For the 
further embodiments the positions are given for L-AChBP T1 (SEQ ID No. 2) followed 
by an indication of the corresponding amino acid positions in the amino acid 
sequence of L-AChBP_T2 (SEQ ID No. 4) .and B-AChBP_T1 (SEQ ID No. 6) & B- 

15 AChBP_T2 (SEQ ID No. 8) in the foim of (L-AChBP_T1&T2 : B-AChBP_T1&T2). 

In a preferred embodiment said solubility determining amino acids (a) comprise 
amino acids Asp(36), Asp(68), Glu(115), Arg(137), Asp(143), Asp(148), Glu(150), 
Arg(167), Arg(189), Glu(215) of SEQ ID No.2, wherein Asp may be exchanged for 
20 Glu and vice versa and Lys may be exchanged for Arg and vice versa (L- 
AChBP_T1&T2 : B-AChBP_T1&T2; Asp(36) : Asp(36); Asp(68): Asp(68); Glu(115): 
Glu(116); Arg(137): Arg(138); Asp(143): Asp(144); Asp(148): Asp(149); Glu(150): 
Glu(151); Arg(167): Gly(167), in L-AChBP_T2 : Lys(168); Arg(189): Lys(190); 
Glu(215):Glu(216). 

25 

In a still more preferred embodiment the water-soluble ligand-binding protein 
comprises the amino acids Cys(142), Thr(149), Ala(153), Thr(154), Cys(155), 
Arg(156), lle(157) and/or Lys(158) of SEQ ID No. 2. (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; Cys(142): Cys(143); Thr(149): Thr(150); Ala(153): Ala(154); 

30 Thr(154): Thr(155); Cys(155): Cys(156); Arg(156): Arg(157); lle(157): lle(158); 
Lys(158): Lys(159). In a further embodiment the water-soluble ligand-binding protein 
comprises either in addition or alternatively the amino acids (b) Pro(39), Trp(77), 
Trp(101), Pro(103), Asp(194), and/or Ser(161) of SEQ ID No. 2 (L-AChBP_T1 &T2 : 
B-AChBP_T1&T2; Pro(39): Pro(39); Trp(77): Trp(77); Trp(101): Trp(102); Pro(103): 

35 Pro(1 04); Ser(1 61 ): Ser(1 62); Asp(1 94): Ser(1 95). 

In a still further embodiment the water-soluble ligand-binding protein comprises either 
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in addition or alternatively to the above described embodiments amino acid 
sequences 165-169 and/or 200-203 of SEQ ID No. 2 have been exchanged with the 
corresponding sequence of the ligand-gated receptor (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; His(165)-lso(169):. Asp(166)-Phe(170) (B-AChBP_T1) : Asp(166)- 
5 Leu(170) (B-AChBP_T2); Asn(200)-Thr(203); lso(203) for L-AChBP_T2: Asn(201)- 
Lys(204). 

The amino acids determining binding to the ligand of the nicotinic acetylcholine 
receptor include three stretches on the nAChR alpha subunits. These stretches 
contain amino acids that are conserved throughout the various nAChR alpha 

10 subunits and that are essential for ligand binding. These stretches (corresponding to 
the Torpedo alpha subunit) are (numbering of nAChR a7 as depicted in SEQ ID No. 
9): Trp (108) - Tyr (115), Trp (108) and Tyr (115) being essential; Trp (171) - Tyr 
(173), the amino acids Trp (171) and Tyr (173) being essential; Tyr (210) - Tyr (217), 
the amino acids Tyr (210), Cys (212), Cys (213) and Tyr (217) being essential. In the 

15 chimeric proteins according to the invention, at least the essential amino acids of at 
least one of these stretches haven been substituted for the corresponding amino 
acids. Preferably, the entire stretches have been substituted. 

In a particularly preferred embodiment of the invention, the water-soluble ligand- 
20 binding protein is capable of binding a ligand of an acetylcholine receptor, wherein in 
said protein at least one of the amino acid sequences Trp(101) - Tyr(T108), Trp(162) 
- His(164) and Tyr(204) - Tyr(211) of SEQ ID No. 2 has been exchanged with the 
corresponding sequence of the acetylcholine receptor (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; Trp(101)-Tyr(108): Trp(102)-Tyr(109); Trp(162)-His(164): Trp(163)- 
25 His(165), (B-AChBPJT1) : Trp(163)-Phe(165) (B-AChBP_T2); Tyr(204)-Tyr(21 1 ): 
Tyr(205)-Tyr(212). 

On the basis of homology to the AChBPs, it is possible to change amino acid 
residues in the original amino acid sequence of the ligand-gated ion channel, which 

30 are not critical to ligand-binding or essential for the tertiary and quaternary structure 
of the receptor but could be substituted to amino acid residues which according to 
the AChBP in particular the crystal structure contributes to their water-solubility. As a 
result the ligand-gated ion channel or its ligand-binding domain or the respective 
monomers and pentamers are for example expected to be more easily expressible in 

35 recombinant expression system and more importantly amenable to crystallization, 
allowing the construction of three-dimensional models of their ligand binding 
domains. 
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Thus, in another embodiment the present invention relates to a method for the 
production of a water-soluble ligand-gated receptor or a corresponding ligand-binding 
domain or for improving the water solubility and accessibility to crystallization of such 
a receptor or domain, said method comprising altering the amino acid sequence of 
5 the extracellular domain of a ligand-gated receptor by way of substituting, adding, 
deleting or modifying at least one amino acid at a position corresponding to an amino 
acid determining or contributing to the water-solubility of the above-described water- 
soluble ligand-binding protein of the present invention The method of the invention 
can be performed using conventional techniques known in the art, for example, by 

10 using amino acid deletion(s), insertion(s), substitution(s), addition(s), and/or 
recombination (s) and/or any other modification(s) known in the art either alone or in 
combination. Methods for introducing such modifications in the DNA sequence 
underlying the amino acid sequence of the ligand-binding domain a ligand-gated ion 
channel are well known to the person skilled in the art; see, e.g., Sambrook, 

15 Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. 
The resulting ligand-gated receptor or ligand-binding domain retains comparable in 
vitro and preferably also in vivo ligand-binding activity to that of the ligand-gated ion 
channel, and more importantly, allow complete crystallization of the protein such that 
they may be characterized by X-ray crystallography. The X-ray crystallographic data 

20 can be used for example for identification and construction of possible therapeutic 
compounds in the treatment of various disease conditions. 

As has- been discussed herein before, the ligand-gated ion channel superfamily 
including nACh, 5-HT3, glycine, GABA A , and GABA C receptors as well as 
invertebrate glutamate ion-channels and MOD-1 serotonin channel contain 
25 extracellular ligand binding domains that are homologous to the AChBP. Many of 
these receptors are promising drug targets. Therefore, the ligand-gated receptor to 
be modified is preferably one of those of the mentioned superfamily, most preferably 
it is nAChR. 

Information on the nucleotide and amino acid sequences, structural elements, 
30 functional assays of the nAch, 5-HT3, glycine, GABA A , and GABA C receptors can be 

found in the prior art. For example, the nicotinic receptors at the amino acid level are 

described in Corringer et al., Annu. Rev. Pharmacol. Toxicol. 40 (2000), 431-458. 

Means for retrieving nucleotide and amino acid sequences, performing sequence 

alignments in order to identify the most likely critical amino acid residues are 
35 described below and in the examples; for further general information see the review 

on periplasmic binding protein (PBP), an ancient protein module present in multiple 

drug receptors by Felder et al., PharmSci. 1(2) (1999). 
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In a preferred embodiment of the method of the present invention, said at least one 
amino acid is altered to the corresponding amino acid of the amino acid sequence 
depicted in any one of SEQ ID Nos. 2, 4, 6 or 8, or to a an equivalent amino acid, 

5 preferably in which said solubility-determining amino acids comprise solvent 
accessible regions in the crystal structure according to Figure 10 or 11. Preferred 
amino acid sequence positions and amino acid substitutions are described above for 
the AChBP and can be applied generally in the method of the present invention. 
It is expected that the insertion of the loop Cys123-Cys136 of the mature AChBP 

10 SEQ ID No. 2 into the equivalent region (Cys127-Cys141) in the mature nicotinic <x7 
homopentamer ligand binding domain creates an easily expressed form of this 
protein. Likewise, this loop or an equivalent loop from other water-soluble ligand 
proteins of the present invention can be inserted into the equivalent region of other 
homopentameric ligand binding domains of ligand gated ion channels such as the 

15 glycine receptor and the 5-HT3 receptor to create an easily expressed form of those 
proteins. 

Thus, in one embodiment, the present invention relates to any one of the above 
described methods, wherein loop Cys123-Cys136 of SEQ ID No. 2 is inserted into 
the corresponding region of the ligand binding domain of the ligand-gated receptor. 

20 

The above described water-soluble ligand-gated receptor or a corresponding ligand- 
binding domain are usually prepared by site-directed mutagenesis of the underlying 
encoding polynucleotide. Once the corresponding polynucleotide has been 
generated it can be used to express the altered ligand-gated receptor or a 
25 corresponding ligand-binding domain. Thus, the method of the present invention 
commonly comprises 

(a) culturing a host cell transfected with and capable of expressing a 

polynucleotide comprising a nucleotide sequence encoding the altered 
amino acid sequence; and optionally 
30 (b) recovering said water-soluble ligand-gated receptor or corresponding 
ligand-binding domain from the culture. 
Methods for the expression and purification of the water-soluble ligand-gated 
receptor or corresponding ligand-binding domain of the present invention are 
described further below. Preferably, the expression system described in Examples 4 
35 and 5, or corresponding expression systems are used. 
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The present invention also relates to the a water-soluble ligand-gated receptor and 
ligand-binding domain obtainable by the above described methods of the invention. 
Preferably, said water-soluble ligand-gated receptor exhibits a 10-fold, more 
preferably 100-fold, still more preferably 1000-fold and most preferably 10000-fold 
5 higher solubility in water than the corresponding wild type, preferably human ligand- 
gated receptor. However, improvements in water solubility of about 2 to 5 fold is also 
already advantageous. The average hydrophobicity may be in the range of -1 00 to - 
400. Accordingly, the present invention provides methods for the prediction and 
creation of mutants and chimeras of ligand binding domains, of homopentameric 
10 acetylcholine receptor subtypes and of other homopentameric ion channels with 
increased solubility. 

In one embodiment the water-soluble ligand-binding protein of the invention further 
comprises a spacer sequence allowing coupling with a carrier body. The spacer 

15 sequence may be an amino acid sequence encodable by a polynucleotide or other 
molecule such as polymethylene anchor groups commonly used in chip technology. 
The chimeric protein of the invention may further comprise a spacer sequence, which 
allows coupling of the protein to a carrier body. Such spacer sequence may be e.g. 
an oligo-histidine stretch attached to the C-terminus of the protein. Such an oligo- 

20 histidine stretch is capable of binding to Talon© metal affinity beads or similar 
carriers. Such binding stretches have no detectable influence on the pharmacological 
properties of the proteins. The chimeric proteins according to the invention can be 
used for screening of specific binding of potential drugs, in particular screening for 
modulators of ion-channel opening. Conventional in vitro screening techniques, such 

25 as phage display technology, can be used for this purpose. High-throughput assays, 
possibly in combination with combinatorial chemistry can also be used. Specific 
binding of test compounds to the (immobilised) chimeric proteins of the invention can 
be performed e.g. by competition binding assays using alpha bungarotoxin as a 
competitor. The invention also concerns test kits containing the proteins described 

30 above, together with further means for carrying out a screening test, such as carriers, 
labels, diluents, other chemicals etc. 

In addition, the present invention relates to fusion proteins comprising the water- 
soluble ligand-binding protein of the invention or a binding fragment thereof and a 
35 fragment of a ligand-gated receptor. The term "fusion protein" as used herein refers 
to protein constructs that are the result of combining multiple protein domains or 
linker regions for the purpose of gaining the combined functions of the domains or 
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linker regions. This is may be accomplished by molecular cloning of the nucleotide 
sequences encoding such domains to produce a new polynucleotide sequence that 
encodes the desired fusion protein. Alternatively, creation of a fusion protein may be 
accomplished by chemically joining two proteins. A fusion protein of the present 
5 invention preferably comprises at least the ligand-binding domain of the AChBP or of 
a ligand-gated ion channel, which has been modified in accordance with the above 
described methods. 

Nicotinic acetylcholine receptors are • comprised of five subunits, selected from a 

10 related family of subunit proteins. The neuronal subunits fall into two main types 
depending on the presence or absence of a pair of vicinal cysteines close to the 
binding site for acetylcholine. Thus all a-subunits contain paired cysteine residues 
thought to play a role in binding of nicotinic agonists (Aplin and Wonnacott, 48 (1994), 
473-477), whereas the p-subunits do not. There are ten known alpha subunits, a1 to 

15 a10, and at least four beta subunits, pi to p4. Receptors comprise at least one alpha 
subunit which in some cell types combine with a beta subunit and in some cases a 
gamma, delta and epsilon subunit. For example, the AChR at the neuromuscular 
junction is believed to have an (a1)2piyS stoichiometry. Within the group of a- 
subunits there is marked diversity in the manner in which a complete functional 

20 nAChR is formed. The majority of the a subunits only form functional receptors when 
combined as a heteropentamer with p-subunits in the CNS (McGehee and Role, 
Annual Review of Physiology 57 (1995), 521-546). However, a7, a8 and a9 nAChR 
subunits and the related 5-HT3A subunit are capable of forming functional 
homopentameric receptors. In this respect it is interesting that the phylogenetic 

25 relationship between nAChR subunits suggest that a7, a8, a9 and the related 5-HT3A 
subunit are more related to each other than to the subunits which only form 
heteropentameric receptors. Sequence homologies indicate that the a7, a8 and a9 
subunits form a distinct subgroup of the alpha subunits. 

As is evident form the foregoing, the above described water-soluble ligand-binding 
30 protein or receptor or ligand-binding domain thereof can be used for forming 
complexes of homo- or heteromultimers, such as a dimer, pentamer or decamer 
consisting of at least one monomer of the mentioned proteins of the present 
invention. Preferably, these multimers constitute a function ligand-gated receptor. 
Preferably, said ligand-gated receptor is related to the nAchR. 

35 

The present invention also relates to the production of synthetic heteropentamers 
resembling heteropentameric gated ion-channels by mutation of AChBP, using 
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knowledge of the crystal structure about the primary and secondary contact regions; 
see infra. Preferably, said synthetic heteropentamer's resembles a heteropentameric 
nicotinic acetylcholine receptor. Accordingly, the present invention more generally 
relates to a ligand-gated ion channel comprising any one of the above described 

5 water-soluble ligand-binding proteins or receptors of the invention as a monomer, 
homo- or heterodimer or -pentamer. This method therefore allows the prediction and 
creation of mutants and chimeras of nicotinic acetylcholine receptors and other 
ligand-gated ion channels that are insensitive or more sensitive to toxin binding, e.g. 
bungarotoxin, lophotoxin, conotoxin, and other toxins that inhibit ligand-gated ion 

10 channels. Preferably, said ligand-gated ion channel is less or more sensitive to 
binding of toxins such as bungarotoxin, lophotoxin or conotoxin compared to the wild 
type ligand-gated ion channel. 

Further information and examples how to create chimeric ligand-binding proteins in 
accordance with the present invention is given in Example 10. 

15 

The nucleotide and amino acid sequences of the acetylcholine, 5-HT3, glycine, 
GABA A , and GABA C receptors can be easily retrieved from public database, for 
example from the internet using http://www.ncbi.nlm.nih.gov/Entrez. The citations 
also include a reference to the corresponding publication also reporting on the 

20 functional expression of the respective receptor. 

The use of recombinant acetylcholine-gated ion channels and functionally assays in 
the discovery of putative novel ligands has been described in Cosford, Pharm. Acta 
Helv. (2000), 74(2-3), 125-130. Furthermore, the cell-free expression and functional 
reconstitution of homo-oligomeric a7 nicotinic acetylcholine receptors into planar lipid 

25 bilayers has been reported by Lyford and Rosenberg, J. Biol. Chem. (1999), 
274(36), 25675-25681 " The use of functional assays of cloned and native muscarinic 
acetylcholine receptors for determining the selectivity profile of toxins has been 
described by Olianas et al. (J. Pharmacol. Exp. Ther. 288 (1999), 164-170). A system 
for the evaluation of pharmacological differences and similarities between 5-HT3 

30 receptors stably transfected cells is provided by for example Bruss et al., Naunyn- 
Schmiedebergs Archives of Pharmacology 360 (1999), 225-33. The primary structure 
and functional expression of the 5-HT3 receptor is described in Maricq et al., Science 
254 (1991), 432-437. Likewise, the stable expression of human glycine a1 and a2 
receptor monomers in mouse L(tk-) cells and their use for the study of the physiology 

35 and pharmacology of functional glycine receptors is described in Wick et al., J. 
Neurosci. Methods 87 (1999), 97-103. An example for the measurement of the 
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pharmacology of recombinant GABA A receptor subtypes is described in Simpson et 
al., J. Neurosci. Methods 99 (2000), 91-100. Further examples for assay systems are 
given below. 

The described methods as well as others known to the person skilled in the art can 
5 be used for example to 

(1) express and characterise the water-soluble ligand-binding proteins and ligand- 
gated ion channels of the present invention; and 

(2) use stably transfected cells expressing the above described ligand-gated ion 
channels for the identification of novel ligands. 

10 

The present invention also relates to polynucleotides encoding the water-soluble 
ligand-binding proteins and ligand-gated ion channels of the present invention, and 
multimers thereof, preferably dimers or pentamers. Such polynucleotide may be a 
DNA such as a cDNA, or an RNA such as mRNA or any other form of nucleic acid 

15 including synthetic or modified derivatives and may encode the polypeptide in a 
continuous sequence or in a number of sequences interrupted by intervening 
sequences. In which ever form it is present, the polynucleotide is an isolated 
polynucleotide in that it is removed from its naturally-occurring state. This aspect of 
the invention is based on the cloning of the cDNA for ligand-binding proteins. In a 

20 preferred embodiment, the polynucleotide comprises the nucleotide sequence of any 
one of SEQ ID Nos. 1, 3, 5 or 7, optionally including one or more mutations or 
deletions which do not substantially affect the activity of the polypeptide encoded 
thereby. Such mutations include those arising from the degeneracy of the genetic 
code, as well as those giving rise to any of the amino acid mutations or deletions 

25 discussed above. The polynucleotides of the invention preferably comprise 

(a) a nucleotide sequence having at least 15 continuous nucleotides of 
the nucleotide sequence depicted in any one of SEQ ID Nos. 1 , 3, 5 or 
7 or a degenerated nucleotide sequence thereof; or 

(b) a nucleotide sequence capable of hybridizing to a nucleotide 
30 sequence of (a) under stringent hybridisation conditions. 

Typically, selective hybridization will occur when there is at least about 55% 
sequence identity - preferably at least about 65%, more preferably at least about 
75%, and most preferably at least about 90% - over a stretch of at least about 14 
35 nucleotides; see, e.g., Kanehisa, Nucleic Acids Res. 12 (1984), 203-213, herein 
incorporated by reference. Nucleic acid hybridization will be affected by such 
conditions as salt concentration, temperature, solvents, the base composition of the 



WO 01/58951 PCT/EP01/01457 

-17 

hybridizing species, length of the complementary regions, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids, as will be readily 
appreciated by those skilled in the art. 

"Stringent hybridization conditions" and "stringent wash conditions" in the context of 
5 nucleic acid hybridization experiments depend upon a number of different physical 
parameters. The most important parameters include temperature of hybridization, 
base composition of the nucleic acids, salt concentration and length of the nucleic 
acid. One having ordinary skill in the art knows how to vary these parameters to 
achieve a particular stringency of hybridization. In general, "stringent hybridization" 
1 0 is performed at about 25°C below the thermal melting point (T m) for the specific 
DNA hybrid under a particular set of conditions. 

"Stringent washing" is performed at temperatures about 5°C lower than the Tm for 
the specific DNA hybrid under a particular set of conditions. The Tm is the 
temperature at which 50% of the target sequence hybridizes to a perfectly matched 

15 probe; see Sambrook et al M page 9.51, hereby incorporated by reference. The Tm 
for a particular DNA-DNA hybrid can be estimated by the formula: 
Tm = 81.5°C + 16.6 (log10[Na+ ]) + 0.41 (fraction G + C) - 0.63- (% formamide) - 
(600/I) where I is the length of the hybrid in base pairs. 
The Tm for a particular RNA-RNA hybrid can be estimated by the formula: 

20 Tm = 79.8°C + 18.5 (log10[Na+ ]) + 0.58 (fraction G + C) + 11.8 (fraction G + C)2 - 
0.35 (% formamide) - (820/I). 

The Tm for a particular RNA-DNA hybrid can be estimated by the formula: 

Tm = 79.8°C + 18.5(log10[Na+ ]) + 0.58 (fraction G + C) + 11.8 (fraction G + C)2 - 

0.50 (% formamide) - (820/I). 

25 In general, the Tm decreases by 1-1. 5°C for each 1% of mismatch between two 
nucleic acid sequences. Thus, one having ordinary skill in the art can alter 
hybridization and/or washing conditions to obtain sequences that have higher or 
lower degrees of sequence identity to the target nucleic acid. For instance, to obtain 
hybridizing nucleic acids that contain up to 10% mismatch from the target nucleic 

30 acid sequence, 10-15°C would be subtracted from the calculated Tm of a perfectly 
matched hybrid, and then the hybridization and washing temperatures adjusted 
accordingly. Probe sequences may also hybridize specifically to duplex DNA under 
certain conditions to form triplex or other higher order DNA complexes. The 
preparation of such probes and suitable hybridization conditions are well known in 

35 the art. An example of stringent hybridization conditions for hybridization of 
complementary nucleic acid sequences having more than 100 complementary 
residues on a filter in a Southern or Northern blot or for screening a library is 50% 
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formamide/6X SSC at 42°C for at least ten hours. Another example of stringent 
hybridization conditions is 6X SSC at 68°C for at least ten hours. An example of low 
stringency hybridization conditions for hybridization of complementary nucleic acid 
sequences having more than 100 complementary residues on a filter in a Southern or 
5 northern blot or for screening a library is 6X SSC at 42°C for at least ten hours. 
Hybridization conditions to identify nucleic acid sequences that are similar but not 
identical can be identified by experimentally changing the hybridization temperature 
from 68°C to 42°C while keeping the salt concentration constant (6X SSC), or 
keeping the hybridization temperature and salt concentration constant (e.g. 42°C and 

10 6X SSC) and varying the formamide concentration from 50% to 0%. Hybridization 
buffers may also include blocking agents to lower background. These agents are 
well-known in the art; see Sambrook et al., pages 8.46 and 9,46-9.58, herein 
incorporated by reference. Wash conditions also can be altered to change stringency 
conditions. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 

15 15 minutes (see Sambrook et al., for SSC buffer). Often the high stringency wash is 
preceded by a low stringency wash to remove excess probe. An exemplary medium 
stringency wash for duplex DNA of more than 100 base pairs is 1x SSC at 45°C for 
15 minutes. An exemplary low stringency wash for such a duplex is 4x SSC at 40°C 
for 15 minutes. In general, signal-to-noise ratio of 2x or higher than that observed for 

20 an unrelated probe in the particular hybridization assay indicates detection of a 
specific hybridization. 

By the provision of the nucleotide sequences of SEQ ID Nos. 1 , 3, 5 and 7 as well as 
those encoding the amino acid sequences depicted in SEQ ID Nos. 2, 4, 6 and 8 it is 
possible to isolate identical or similar nucleic acid molecules which encode water- 

25 soluble ligand-binding proteins from other species or organisms, in particular 
orthologous water-soluble ligand-binding protein encoding genes from mammals. 
The term "orthologous" as used herein means homologous sequences in different 
species that arose from a common ancestor gene during speciation. Orthologous 
genes may or may not be responsible for a similar function; see, e.g., the glossary of 

30 the "Trends Guide to Bioinformatics", Trends Supplement 1 998, Elsevier Science. 

In a further aspect, the present invention provides a recombinant polynucleotide 
comprising a vector incorporating the polynucleotide of the present invention. Many 
suitable vectors are known to those skilled in molecular biology, the choice of which 
35 would depend on the function desired and include plasmids, cosmids, viruses, 
bacteriophages and other vectors used conventionally in genetic engineering. 
Methods which are well known to those skilled in the art can be used to construct 
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various plasmids and vectors; see, for example, the techniques described in 
Sambrook, Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory 
(1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. (1989), (1994). Alternatively, the 
5 polynucleotides and vectors of the invention can be reconstituted into liposomes for 
delivery to target cells. As discussed in further details below, a cloning vector was 
used to isolate individual sequences of DNA. Relevant sequences can be transferred 
into expression vectors where expression of a particular polypeptide is required. 
Typical cloning vectors include pBscpt sk, pGEM, pUC9, pBR322 and pGBT9. 
10 Typical expression vectors include pTRE, pCAL-n-EK, pESP-1, pOP13CAT, pET, 
pGEX, pMALC, pPIC9, pBac. 

Hence, in a preferred embodiment of the present invention the above-described 
polynucleotides either alone or present in a vector are linked to control sequences 
15 which allow the expression of the polynucleotide in prokaryotic and/or eukaryotic 
cells. 

The term "control sequence" refers to regulatory DNA sequences which are 
necessary to effect the expression of coding sequences to which they are ligated. 
The nature of such control sequences differs depending upon the host organism. In 

20 prokaryotes, control sequences generally include promotor, ribosomal binding site, 
and terminators. In eukaryotes generally control sequences include promotors, 
terminators and, in some instances, enhancers, transactivators or transcription 
factors. The term "control sequence" is intended to include, at a minimum, all 
components the presence of which are necessary for expression, and may also 

25 include additional advantageous components. 

The term "operably linked" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under conditions compatible with the 

30 control sequences. In case the control sequence is a promotor, it is obvious for a 
skilled person that double-stranded nucleic acid is preferably used. 
Thus, the vector of the invention is preferably an expression vector. An "expression 
vector" is a construct that can be used to transform a selected host cell and provides 
for expression of a coding sequence in the selected host. Expression vectors can for 

35 instance be cloning vectors, binary vectors or integrating vectors. Expression 
comprises transcription of the nucleic acid molecule preferably into a translatable 
mRNA. Regulatory elements ensuring expression in prokaryotic and/or eukaryotic 
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cells are well known to those skilled in the art. In the case of eukaryotic cells they 
comprise normally promotors ensuring initiation of transcription and optionally poly-A 
signals ensuring termination of transcription and stabilization of the transcript. 
Possible regulatory elements permitting expression in prokaryotic host cells 

5 comprise, e.g., the PL, lac, trp, T7 or tac promotor in E. coli, and examples of 
regulatory elements permitting expression in eukaryotic host cells are the AOX1 or 
GAL1 promotor in yeast or the CMV-, SV40-, RSV-promotor (Rous sarcoma virus), 
CMV-enhancer, SV40-enhancer or a globin intron in mammalian and other animal 
cells. In this context, suitable expression vectors are known in the art such as 

10 Okayama-Berg cDNA expression vector pcDV1 (Pharmacia), pCDM8, pRc/CMV, 
pcDNAI , pcDNA3 (In-vitrogene), pSPORTI (GIBCO BRL). An alternative expression 
system which could be used to express the protein is an insect system. In one such 
system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 

15 larvae. The coding sequence of a nucleic acid molecule of the invention may be 
cloned into a nonessential region of the virus, such as the polyhedrin gene, and 
placed under control of the polyhedrin promotor. Successful insertion of said coding 
sequence will render the polyhedrin gene inactive and produce recombinant virus 
lacking coat protein coat. The recombinant viruses are then used to infect S. 

20 frugiperda cells or Trichoplusia larvae in which the protein of the invention is 
expressed (Smith, J. Virol. 46 (1983), 584; Engelhard, Proc. Nat. Acad. Sci. USA 91 
(1994), 3224-3227). 

In plants, promotors commonly used are the polyubiquitin promotor, and the actin 
promotor for ubiquitous expression. The termination signals usually employed are 

25 from the Nopaline Synthase promotor or from the CAMV 35S promotor. A plant 
translational enhancer often used is the TMV omega sequences, the inclusion of an 
intron (Intron- 1 from the Shrunken gene of maize, for example) has been shown to 
increase expression levels by up to 100-fold. (Mait, Transgenic Research 6 (1997), 
143-156; Ni, Plant Journal 7 (1995), 661-676). Additional regulatory elements may 

30 include transcriptional as well as translational enhancers. Advantageously, the 
above-described vectors of the invention comprises a selectable and/or scorable 
marker. Selectable marker genes useful for the selection of transformed cells and, 
e.g., plant tissue and plants are well known to those skilled in the art and comprise, 
for example, antimetabolite resistance as the basis of selection for dhfr, which 

35 confers resistance to methotrexate (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994), 
143-149); npt, which confers resistance to the aminoglycosides neomycin, 
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kanamycin and paromycin (Herrera-Estrella, EMBO J. 2 (1983), 987-995) and hygro, 
which confers resistance to hygromycin (Marsh, Gene 32 (1984), 481-485). 
Useful scorable markers are also known to those skilled in the art and are 
commercially available. Advantageously, said marker is a gene encoding luciferase 
(Giacomin, PI. Sci. 116 (1996), 59-72; Scikantha, J. Bact. 178 (1996), 121), green 
fluorescent protein (Gerdes, FEBS Lett. 389 (1996), 44-47) or 3-glucuronidase 
(Jefferson, EMBO J. 6 (1987), 3901-3907). This embodiment is particularly useful for 
simple and rapid screening of cells, tissues and organisms containing a vector of the 
invention. 

The proteins can be recovered and purified from recombinant cell cultures by well- 
known methods including ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, size exclusion chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") or FPLC is employed 
for purification. 

The present invention furthermore relates to host cells produced by introducing a 
nucleic acid molecule into the host cell which upon its presence in the cell mediates 
the expression of a gene encoding water-soluble ligand-binding proteins or 
comprising a polynucleotide or a vector as described above or a polynucleotide 
according to the invention wherein the polynucleotides and/or nucleic acid molecule 
is foreign to the host cell. By "foreign 0 it is meant that the polynucleotide or nucleic 
acid molecule is either heterologous with respect to the host cell, this means derived 
from a cell or organism with a different genomic background, or is homologous with 
respect to the host cell but located in a different genomic environment than the 
naturally occurring counterpart of said nucleic acid molecule. This means that, if the 
nucleic acid molecule is homologous with respect to the host cell, it is not located in 
its natural location in the genome of said host cell, in particular it is surrounded by 
different genes. In this case the polynucleotide may be either under the control of its 
own promotor or under the control of a heterologous promotor. The vector or nucleic 
acid molecule according to the invention which is present in the host cell may either 
be integrated into the genome of the host cell or it may be maintained in some form 
extrachromosomally. In this respect, it is also to be understood that the nucleic acid 
molecule of the invention can be used to restore or create a mutant gene via 
homologous recombination. 



WQ_01/58?51_ 



Page 23 of 259 



WO 01/58951 PCT/EP01/01457 

-22 

The host cell can be any prokaryotic or eukaryotic cell, such as bacterial, insect, 
fungal, plant or animal cells. 

The term "prokaryotic" is meant to include all bacteria which can be transformed or 
transfected with a DNA or RNA molecules for the expression of a' protein of the 

5 invention. Prokaryotic hosts may include gram negative as well as gram positive 
bacteria such as, for example, E. coli, S. typhimurium, Serratia marcescens and 
Bacillus subtilis. The term "eukaryotic" is meant to include yeast, higher plant, insect 
and preferably mammalian cells. Depending upon the host employed in a 
recombinant production procedure, the protein encoded by the polynucleotide of the 

10 present invention may be glycosylated or may be non-glycosylated. The water- 
soluble ligand-binding protein of the invention may or may not also include an initial 
methionine amino acid residue. A polynucleotide of the invention can be used to 
transform or transfect the host using any of the techniques commonly known to those 
of ordinary skill in the art. Furthermore, methods for preparing fused, operably linked 

15 genes and expressing them in, e.g., mammalian cells and bacteria are well-known in 
the art (Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1 989). 

Thus the present invention provides a cell capable of expressing a polypeptide as 
discussed herein. The cell comprises a recombinant host cell usually incorporating 
20 the polynucleotide. Preferably, the host cell incorporates the polynucleotide as the 
recombinant polynucleotide. Any suitable host cell may be chosen, again depending 
on the intended purpose. Suitable host cells include XLI-BLUE, B21(DE3)pLysS, 
HB101, SOLR and SP-Q01 (Saccharomyces pombe). 

Using an appropriate combination of host cell, vector and polynucleotide, an 
25 expression system can be provided so as to obtain a polypeptide useful in the 
present invention. This may comprise a fusion polypeptide encoded by the 
recombinant polynucleotide, a part of which is encoded by the vector. Typically, the 
vector will have a promoter region, which is usually inducible, leading to 5' coding 
region associated with the promotor. By appropriate manipulation, the polynucleotide 
30 encoding the polypeptide can be attached to the 5' coding region in frame. In this 
way, expression of the nucleotide sequence downstream of the promotor region 
gives rise to the fusion polypeptide which includes the polypeptide of the present 
invention. 

35 The present invention also relates to an antigen comprising an epitope of at least 5 
continuous amino acids of the amino acid sequence depicted in any one of SEQ ID 
Nos. 2, 4, 6 or 8 and/or said epitope is detectable by a monoclonal or polyclonal 
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antibody which recognises, preferably with a. binding affinity of at least 10* 7 M, a 
protein of the invention as described above. In the present invention, "epitopes" 
refers to fragments of the AChBP of the invention having antigenic or immunogenic 
activity in an animal. A preferred embodiment of the present invention relates to 
5 antigens comprising an epitope, as well as the polynucleotide encoding this 
fragment A region of a protein molecule to which an antibody can bind is defined as 
an "antigenic epitope." In contrast, an "immunogenic epitope" is defined as a part of a 
protein that elicits an antibody response; see, for instance, Geysen, Proc. Natl. Acad. 
Sci. USA 81 (1983); 3998-4002. Fragments which function as epitopes may be 

10 produced by any conventional means; see, e.g., Houghten, Proc. Natl. Acad. Sci. 
USA 82 (1985), 5131-5135 further described in U.S. Patent No. 4,631,211. In the 
present invention, antigenic epitopes preferably contain a sequence of at least five, 
six, seven, more preferably at least nine, and most preferably between about 1 5 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, Including 

15 monoclonal antibodies, that specifically bind the epitope; see, for instance, Wilson, 
Cell 37 (1984), 767-778; Sutcliffe, Science 219 (1983), 660-666). Similarly, 
immunogenic epitopes can be used to induce antibodies according to methods well 
known in the art; see, for instance, Sutcliffe, supra; Wilson, supra; Chow, Proc. Natl. 
Acad. Sci. USA 82 (1985), 910-914; and Bittle, J. Gen. Virol. 66 (1985); 2347-2354. 

20 A preferred immunogenic epitope includes the soluble protein. The immunogenic 
epitopes may be presented together with a carrier protein, such as an albumin, to an 
animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. However, immunogenic epitopes comprising as few 
as 8 to 10 amino acids have been shown to be sufficient to raise antibodies capable 

25 of binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., in 
Western blotting.) 

The present invention also relates to antibodies specifically recognizing the water- 
soluble ligand-binding protein and ligand-gated ion channels of the present invention, 

30 In particular recognizing the above described antigen or epitope. As used herein, the 
term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact 
molecules as well as antibody fragments (such as, for example, Fab and F(ab') 2 
fragments) which are capable of specifically binding to protein. Fab and F(ab') 2 
fragments lack the Fc fragment of intact antibody, clear more rapidly from the 

35 circulation, and may have less non-specific tissue binding than an intact antibody; 
see, e.g., Wahl, J. Nucl. Med. 24 (1983), 316-325. Thus, these fragments are 
preferred, as well as the products of a FAB or other immunoglobulin expression 



WO 01/58951 PCT/EP01/01457 

-24 

library. Moreover, antibodies of the present invention include chimeric, single chain, 
and humanized antibodies; see also infra. Said antibody can be a monoclonal 
antibody, a polyclonal antibody, a single chain antibody, human or humanized 
antibody, primatized, chimerized or fragment thereof that specifically binds said 
peptide or polypeptide also including bispecific antibody, synthetic antibody, antibody 
fragment, such as Fab, Fv or scFv fragments etc., or a chemically modified derivative 
of any of these. The general methodology for producing antibodies is well-known and 
has been described in, for example, Kohler and Milstein, Nature 256 (1975), 494 and 
reviewed in J.G.R. Hurrel, ed., "Monoclonal Hybridoma Antibodies: Techniques and 
Applications", CRC Press Inc., Boco Raron, FL (1982), as well as that taught by L T. 
Mimms et al., Virology 176 (1990), 604-619. Furthermore, antibodies or fragments 
thereof to the aforementioned peptides can be obtained by using methods which are 
described, e.g., in Harlow and Lane "Antibodies, A Laboratory Manual", CSH Press, 
Cold Spring Harbor, 1988. For the production of antibodies in experimental animals, 
various hosts including goats, rabbits, rats, mice, and others, may be immunized by 
injection with polypeptides of the present invention or any fragment or oligopeptide or 
derivative thereof which has immunogenic properties. Techniques for producing and 
processing polyclonal antibodies are known in the art and are described in, among 
others, Mayer and Walker, eds., "Immunochemical Methods in Cell and Molecular 
Biology", Academic Press, London (1987). Polyclonal antibodies also may be 
obtained from an animal, preferably a mammal, previously infected with the virus of 
the invention. Methods for purifying antibodies are known in the art and comprise, for 
example, immunoaffinity chromatography. Depending on the host species, various 
adjuvants or immunological carriers may be used to increase immunological 
responses. Such adjuvants include, but are not limited to, Freund's, complete or 
incomplete adjuvants, mineral gels such as aluminium hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions 
and dinitrophenol. An example of a carrier, to which, for instance, a peptide of the 
invention may be coupled, is keyhole limpet hemocyanin (KLH). When derivatives of 
said antibodies are obtained by the phage display technique, surface plasmon 
resonance as employed in the BIAcore system can be used to increase the efficiency 
of phage antibodies which bind to an epitope of the peptide or polypeptide of the 
invention (Schier, Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. 
Immunol. Methods 183 (1995), 7-13). In many cases, the binding phenomena of 
antibodies to antigens is equivalent to other ligand/anti-ligand binding. 
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In another embodiment the present invention relates to an oligonucleotide probe 
comprising a nucleotide sequence having at least 1 5 continuous nucleotides of a 
polynucleotide of the invention and/or encoding the above described antigen. Such 
oligonucleotides will usually specifically hybridize to a polynucleotide encoding a 
5 water-soluble ligand-binding protein of the invention. Specific hybridization occurs 
preferably under stringent conditions and implies no or very little cross-hybridization 
with nucleotide sequences encoding no or substantially different proteins. Such 
nucleic acid molecules may be used as probes and/or for the control of gene 
expression. Nucleic acid probe technology is well known to those skilled in the art 

1 0 who will readily appreciate that such probes may vary in length. Preferred are nucleic 
acid probes of 1 7 to 35 nucleotides in length. Of course, it may also be appropriate to 
use nucleic acids of up to 100 and more nucleotides in length. The nucleic acid 
probes of the invention are useful for various applications. On the one hand, they 
may be used as PCR primers for amplification of polynucleotides according to the 

15 invention. Another application is the use as a hybridization probe to identify 
polynucleotides hybridizing to the polynucleotides of the invention by homology 
screening of genomic DNA libraries. Nucleic acid molecules according to this 
preferred embodiment of the invention which are complementary to a polynucleotide 
as described above may also be used for repression of expression of a gene 

20 comprising such a polynucleotide, for example due to an antisense or triple helix 
effect or for the construction of appropriate ribozymes (see, e.g., EP-B1 0 291 533, 
EP-A1 0 321 201, EP-A2 0 360 257) which specifically cleave the (pre)-mRNA of a 
gene comprising a polynucleotide of the invention. Selection of appropriate target 
sites and corresponding ribozymes can be done as described for example in 

25 Steinecke, Ribozymes, Methods in Cell Biology 50, Galbraith et al. eds Academic 
Press, Inc. (1995), 449-460. Standard methods relating to antisense technology have 
also been described (Melani, Cancer Res. 51 (1991), 2897-2901). Said nucleic acid 
molecules may be chemically synthesized or transcribed by an appropriate vector 
containing a chimeric gene which allows for the transcription of said nucleic acid 

30 molecule in the cell. Such nucleic acid molecules may further contain ribozyme 
sequences as described above. 

In this respect, it is also to be understood that the polynucleotide of the invention can 
be used for "gene targeting" and/or "gene replacement", for restoring a mutant gene 
or for creating a mutant gene via homologous recombination; see for example 
35 Mouellic, Proc. Natl. Acad. Sci. USA, 87 (1990), 4712-471 6; Joyner, Gene Targeting, 
A Practical Approach, Oxford University Press. 
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Furthermore, the person skilled in the art is well aware that it is also possible to label 
such a nucleic acid probe with an appropriate marker for specific applications, such 
as for the detection of the presence of a polynucleotide of the invention in a sample 
derived from an organism, in particular mammals, preferably human. A number of 
5 companies such as Pharmacia Biotech (Piscataway NJ), Promega (Madison Wl), 
and US Biochemical Corp (Cleveland OH) supply commercial kits and protocols for 
these procedures. Suitable reporter molecules or labels include those radionuclides, 
enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as 
substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the 
10 use of such labels include US Patents US-A-3,81 7,837; US-A-3,850,752; US-A- 
3,939,350; US-A-3,996,345; US-A-4,227,437; US-A-4,275,149 and US-A-4,366,241 . 
Also, recombinant immunoglobulins may be produced as shown in US-A-4,81 6,567 
incorporated herein by reference. 

Furthermore, the so-called "peptide nucleic acid" (PNA) technique can be used for 
15 the detection or inhibition of the expression of a polynucleotide of the invention. For 
example, the binding of PNAs to complementary as well as various single stranded 
RNA and DNA nucleic acid molecules can be systematically investigated using 
thermal denaturation and BIAcore surface-interaction techniques (Jensen, 
Biochemistry 36 (1997), 5072-5077). 

20 

The present invention also relates to a method for the production of a transgenic non- 
human animal, preferably transgenic mouse, comprising introduction of a 
polynucleotide or vector of the invention into a germ cell, an embryonic cell, stem cell 
or an egg or a cell derived therefrom. The non-human animal can be used in 

25 accordance with a screening method of the invention described herein. Production of 
transgenic embryos and screening of those can be performed, e.g., as described by 
A. L Joyner Ed., Gene Targeting, A Practical Approach (1993), Oxford. University 
Press. The DNA of the embryonal membranes of embryos can be analyzed using, 
e.g., Southern blots with an appropriate probe; see supra. The invention also relates 

30 to transgenic non-human animals such as transgenic mouse, rats, hamsters, dogs, 
monkeys, rabbits, pigs, C. elegans and fish such as Torpedo fish comprising a 
polynucleotide or vector of the invention or obtained by the method described above, 
preferably wherein said polynucleotide or vector is stably integrated into the genome 
of said non-human animal, preferably such that the presence of said polynucleotide 

35 or vector leads to the expression of the water-soluble protein of the present invention. 
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The present Invention further relates to composition comprising any one of the above 
described water-soluble ligand-binding proteins, multimers such as dimers or 
pentamers thereof, ligand-gated ion channels, polynucleotides, vectors, host cells, 
antigens, antibodies, or oligonucleotide probes of the invention; and optionally 
5 suitable means for detection or performing a ligand-receptor binding assay. In this 
context, the present invention also relates to a method for identifying an 
agonist/activator or antagonist/inhibitor of a ligand-gated receptor comprising the 
steps of: 

(a) contacting the water-soluble ligand-binding protein of the present invention, 
10 multimers such as dimers or pentamers thereof, or the ligand-gated ion 

channel of the invention or a cell expressing said protein in the presence of 
components capable of providing a detectable signal in response to ligand 
binding with a compound to be screened under conditions that permit binding 
of said compound to the ligand-binding protein; and 
15 (b) detecting the presence or absence of a signal generated from the binding 
activity of the ligand-binding protein, wherein the presence/increase and 
absence/decrease of the signal is indicative for an agonist/activator and 
antagonist/inhibitor, respectively, of a ligand-gated receptor. 

Since ligand-gated receptors are modulated allosterically by natural polyamines, 
such as spermine, and by polyamine derivatives, such as polyamine amides (e.g. 
philanthotoxin-343) and polymethylene tetraamines (e.g. methoctramine) 
(Usherwood, Farmaco. 55 (2000), 202-205) compounds comprising or based on 
such entities may be used as starting material for screening. An antagonist or agonist 
that "modulates the activity" of a polypeptide and causes an altered signal, for 
example response in the cell refers to a compound that alters the activity of the 
protein so that it behaves differently in the presence of the compound than in the 
absence of the compound. Typically, the effect of an antagonist is observed as a 
blocking of agonist-induced receptor activation. Antagonists include competitive as 
well as non-competitive antagonists. A competitive antagonist (or competitive 
blocker) interacts with or near the site specific for agonist binding. A non-competitive 
antagonist or blocker inactivates the function of the receptor by interacting with a site 
other than the agonist interaction site. As understood by those of skill in the art, 
bioassay methods for identifying compounds that modulate the activity of receptors 
such as proteins of the invention generally require comparison to a control. One type 
of "control" is a cell or culture that is treated substantially the same as the test cell or 
test culture exposed to the compound, with the distinction that the "control" cell or 
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culture is not exposed to the compound. For example, in methods that use voltage 
clamp electrophysiological procedures, the same cell can be tested in the presence 
or absence of compound, by merely changing the external solution bathing the cell. 
Accordingly, the response of the transfected cell to the "control" cell or culture to the 
5 same compound under the same reaction conditions. However, "control data" can 
also be used from the literature. 



As described in Example 6 the 3-dimensional structure of AChBP could be solved by 
X-ray crystallography at 2.7A resolution (current Rfactor = 27.9 %, Rfree = 30.0 %). 

10 In crystals, as in solution, AChBP forms a stable homo-pentamer with dimensions 
comparable to those of the ligand-binding domain of ligand-gated ion channels, in 
particular comparable to the nAChR, as determined in EM studies by Unwin and 
coworkers; see supra. The structural analysis revealed that in the AChBP 
homopentamer the monomers have immunoglobulin-like topology. At each of five 

15 subunit interfaces a ligand-binding site is located, with all residues consistent with 
biochemical data. In this site a buffer molecule (HERPES) stacks with cation-rc 
interactions on a tryptophan, resembling acetylcholine binding. The AChBP structure 
is relevant for the development of drugs against, e.g., Alzheimer's disease and 
nicotine addiction. The high-resolution crystal structure of AChBP, along with 

20 biochemical and pharmacological data, supports the teaching of the present 
invention that the water-soluble ligand-binding proteins of the invention such as 
AChBP are good mimics of ligand-binding domains of ligand-gated ion channels. 

Thus, the present invention relates to a crystal of a water-soluble ligand-binding 
25 protein of the invention, preferably in a multimeric form such as dimer, pentamer or 
decamer. In one embodiment said crystal comprises a protein-ligand complex. 
Methods how to employ and analyze such crystals are known to the person skilled in 
the art; see for example US-A-5,872,01 1 which describes the crystal structure of a 
protein-ligand complex containing an N-terminal truncated elF4E and uses thereof. 
30 The crystal structure of the ligand-gated receptor ligand-binding region in a complex 
with a ligand, preferably being an antagonist or agonist will reveal the determinants of 
receptor-antagonist/agonist interactions and how ligand-binding specificity and 
affinity are altered by remote residues and the redox state of the conserved 
disulphide bond. The structure may also indicate mechanisms for allosteric effector 
35 action and for ligand-induced channel gating. How the information on the crystal 
structure of a ligand-binding region in a complex with a ligand can be used for the 
development of agonists and antagonists has been described for the structure of a 
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glutamate-receptor ligand-binding core in complex with kainate (Armstrong et al., 
Nature 395 (1998), 913-917). 

The crystal of the invention, in particular when comprising nAChR related proteins 
5 can be a complex of the protein with a ligand comprising an N-alkytated hydroxyalkyl 
and/or a quaternary ammonium ion. However, other ligands my be used as well. 
Preferred ligands comprise 4-(2-Hydroxyethyl)-1 -piperazineethanesulfonic acid 
(HEPES), B-bippinatin, lophotoxin, d-tubocurarine, nicotine, acetylcholine, conotoxin, 
carbamylcholine, galanthamine, epibatidine or alpha-bungarotoxin or derivatives 
10 thereof. 

Different aspects of X-ray crystallography are such as data collection, structure 
solution, determining the molecular structure from X-ray diffraction, refinement, etc. 
are described in the prior art, see, e.g., Powell, Annu. Rep. Prog. Chera, Sect. C: 
15 Phys. Chem. 96 (2000), 139-175 and Methods in Enzymology, 276-277, edited by 
Carter and Sweet, Academic Press, 1997. Current methods and optimization 
algorithms for the refinement of X-ray crystal structures are described by Van Der 
Maelen Uria, Crystallogr. Rev. 7 (1999), 125-180. 

The crystal of the invention effectively diffracts X-rays for the determination of the 
20 atomic coordinates of the protein or protein-ligand complex to a resolution of greater 
than 5.0, preferably greater than 4.0 Angstroms. In a preferred embodiment the 
crystal effectively diffracts X-rays for the determination of the atomic coordinates of 
the protein-ligand complex to a resolution of greater than 3.0 Angstroms. In a more 
' preferred embodiment the crystal effectively diffracts X-rays for the determination of 
25 the atomic coordinates of the protein-ligand complex to a resolution of greater than 
2.0 Angstroms. In one embodiment the crystal effectively diffracts X-rays for the 
determination of the atomic coordinates of the protein-ligand complex to a resolution 
of about 2.7 Angstroms. 

30 Preferably, the crystal of the invention is formed by a protein that has an amino acid 
sequence of amino adds 20 to 223 of SEQ ID No. 2, or an amino acid sequence that 
differs from amino acid 20 to 223 of SEQ ID No. 2 by only having conservative 
substitutions. As is described in the examples, the crystals of the AChBP comprise 
decameric forms of the protein. In order to ease the use of the AChBP protein for 

35 analysis and crystallography it is envisaged to create a mutation in residue Asp2 and 
Asp5 of the mature AChBP SEQ ID No. 2 or 4 to remove the calcium binding site, 
and prevent creation of a decamer. This deletion can be done for example by 
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oligonucleotide-directed mutagenesis. Alternatively crystals could be grown in a low 
calcium concentration or in the absence of calcium. 

The crystal of the present invention preferably has (1) a space group of P2 i 2 1 2 1 and 
a unit cell of dimensions of a=120.6A, b=137.0A and c=161 .5A; (2) a space group of 
P4 2 2^2 and a unit cell of dimensions of a=b=141.6A and c=120.8A or (3) a space 
group of R2i and a unit cell of dimensions of a=121.lA, b=162.lA, c=139.4A, 
(3=90.1°. 

The crystal of the present invention is preferably from a protein that has secondary 
structural elements that include .alpha.-helix and antiparallel .beta-sheets as shown 
in and described for Figures 7, 10, 11 and/or 12. Most preferably, the crystal of the 
invention has a three-dimensional structure as defined by atomic coordinates shown 
in Table 1 . Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the purpose of 
this invention, any set of structure coordinates for AChBP or AChBP mutants that 
have a root mean square deviation of protein backbone atoms (N, C.alpha., C and O) 
of less than 0.75 Angstrom when superimposed - using backbone atoms - on the 
structure coordinates listed in Table 1 shall be considered identical. 
In a most preferred embodiment of the present invention, the crystal has a binding 
cavity as shown in Figures 6, 8, 9 and/or 13. 

In accordance with the findings of the present invention, it is proposed to use the 
water-soluble ligand-binding proteins of molluscs as the blueprint for the receptor 
binding site of the ligand-gated ion channel superfamily including nACh, 5-HT3, 
glycine, GABA A , and GABA Cj most preferably for the nAChR. The availability of X-ray 
structures, and the cloned sequences provide a unique opportunity to understand 
these receptors at the molecular level, possibly unravel the dynamic changes 
occurring upon ligand binding, and predict their tertiary and quaternary structure with 
a higher degree of confidence than possible for other protein modules. This should 
pave the way for designing ligands selective for any of the multiple subtypes in any of 
these receptor families. The AChBP-like structures can be used for computerized 
docking to homology models which leads to the a priori discovery of novel ligands 
before laboratory experiments begin to optimize the drug candidates. 
Thus, the present invention also relates to a method of using the crystal of the 
invention in a drug screening assays, such as comprising: 
(a) selecting a potential ligand by performing structure assisted drug 

design with the three-dimensional structure determined for the crystal, 
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wherein said selecting is performed in conjunction with computer 
modeling; optionally 
(b) contacting the potential ligand with the ligand binding domain of 
the ligand-gated receptor in an in vitro or in vivo assay; and 
5 (c) detecting the binding of the potential ligand for the ligand binding 
domain. 

The use of macromolecular crystallography as a tool for investigating drug and 
receptor interactions, in particular structure-based drug design is reviewed in Oakley 

10 and Wilce, Clin. Exp. Pharmacol. Physiol. 27 (2000), 145-151. The desired drug 
could be an inhibitor or an agonist that mimics endogenous transmitters or ligands. 
Once the 3-D structure of the relevant target is known, computational processes can 
be used to search databases of compounds to identify ones that may interact 
strongly with the target Lead compounds can be improved using the 3-D structure of 

15 the complex of the lead compound and its biological target. The activity of the 
selected compound can then be tested in a functional assay such as one of those 
described herein.. 

Preferably, the potential drug is selected on the basis of its having a greater affinity 
for the ligand binding domain of the ligand-gated receptor than that of a standard 

20 ligand for the ligand binding domain of the ligand-gated receptor. However, the 
affinity of the selected compound may also be less than that of a standard ligand. 
Such compounds are useful for example as a lead for the development of further 
analogues which in turn may have enhanced binding affinity or otherwise beneficial 
therapeutic properties. On the other hand, the selected compound may bind to a site 

25 of the ligand-gated receptor other than known ligands. In a preferred embodiment, 
the ligand-gated receptor is a nicotinic acetylcholine receptor. 

In a further embodiment, the method of the present invention further comprises: 
(d) forming a supplemental crystal of a protein-ligand complex by co- 
30 crystallization or soaking the crystal of the water-soluble ligand-binding 

protein with a potential drug, wherein the crystal effectively diffracts X-rays for 
the determination of the atomic coordinates of the protein-ligand complex to a 
resolution of greater than 5.0, preferably greater than 4.0 Angstroms, more 
preferably greater than 3; 
35 (e) determining the three-dimensional structure of the supplemental crystal; 
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(f) selecting a candidate drug by performing a structure assisted drug 
design with the three-dimensional structure determined for the 
supplemental crystal, wherein said selecting is performed in 
conjunction with computer modeling; optionally 
5 (g) contacting the candidate drug with a cell that expresses the ligand- 
gated receptor; and 

(h) detecting a cell response; wherein a candidate drug is identified as a drug 
when the cell response is altered compared to a cell that has not been 
contacted with the candidate compound. 
1 0 The above described methods can further comprise an initial step that precedes step 
(a) wherein said initial step consists of determining the three-dimensional structure of 
a crystal comprising a protein-ligand complex formed between the water-soluble 
ligand-binding protein, and the ligand of the ligand-gated receptor, wherein the 
. crystal effectively diffracts X-rays for the determination of the atomic coordinates of 
15 the protein-ligand complex to a resolution of greater than 5.0, preferably greater than 
4.0 Angstroms. Preferably, the resolution of crystal diffraction in the above described 
methods is at least 3.0, most preferably at least about 2.7 Angstroms. 

In a still further embodiment, the present invention relates to a method of growing a 
20 crystal of a protein-ligand complex comprising: 

(a) contacting the water-soluble ligand-binding protein described above with a 
ligand of a ligand-gated receptor, wherein the water-soluble ligand-binding 
protein forms a protein-ligand complex with the ligand; and 

(b) growing the crystal of the protein-ligand complex; wherein the 
25 crystal effectively diffracts X-rays for the determination of the 

atomic coordinates of the protein-ligand complex to a resolution 

of greater than 5.0, preferably greater than 4.0 Angstroms, more preferably at 

least 3.0, most preferably at least about 2.7 Angstroms. 



30 The crystals of the present invention can also be used in X-ray crystallography- 
driven screening technique that combines the steps of lead identification, structural 
assessment, and optimization such as described for example in Nienaber et al., 
Nature Biotechnol. 18 (2000), 1105 - 1108. This crystallographic screening method 
(named CrystaLEAD) has been used to sample , large compound libraries and 

35 detecting ligands by monitoring changes in the electron density map of the crystal 
relative to the unbound form. The electron density map yields a high- resolution 
picture of the ligand-protein complex that provides key information to a structure- 
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directed drug discovery process. The bound ligand is directly visualized in the 
electron density map. Ligands that bind away from the targeted site may be 
eliminated. 

The above described methods can be coupled with state-of-the-art laboratory data 
5 collection facilities including CCD detectors and data acquisition robotics. 

Further embodiments that may be used in accordance with the ligand-binding 
proteins and receptor of the present invention are described in the prior art, for 
example ligand screening and design by X-ray crystallography is disclosed in 
W099/45379 and W099/45389; WO00/14105 describes assaying a candidate 
1 0 compound for its ability to interact with a modified receptor tyrosine kinase including 
obtaining and applying crystallography coordinates to a computer algorithm for 
generating a model which is applied in an iterative process to various molecular 
structures in order to identify agonist and antagonists of the receptor. All these 
methods may be equally applied to the proteins and crystals of the present invention. 

15 

In one preferred embodiment, the present invention relates to a drug screening assay 
comprising soaking a crystal of the invention in a solution of compounds to be 
screened and detecting the binding of the compound to the ligand-binding protein. A 
possible procedure is also described in Example 9. Besides the detection methods of 
20 ligand-binding mentioned above, in the cited documents and in the examples, the 
detection can also be based on measuring the release of the ligand in the preformed 
crystal of a protein-ligand complex. As described herein before, said ligand preferably 
comprises an alkylated nitrogen and/or quaternary ammonium ion or may be one of 
those described above. 

25 

The structural information on the crystals of the present invention can also be used 
for increasing or decreasing the affinity of a drug to a ligand-gated receptor. Such a 
method can comprise performing structure assisted drug design with the three- 
dimensional structure determined for the crystal, wherein said drug design is 

30 performed in conjunction with computer modeling; and modifying said drug to alter or 
eliminate a portion thereof suspected of interacting with a binding site of the binding 
cavity or with a non-specific binding site of the protein in the crystal. This method 
can, of course, be combined with one or more steps of any of the above described 
screening methods or other screening methods well known in the art. Methods for 

35 clinical compound discovery comprises for example ultrahigh-throughput screening 
(Sundberg, Curr. Opin. Biotechnol. 1 1 (2000), 47-53) for lead identification, and 
structure-based drug design (Verlinde and Hoi, Structure 2 (1994), 577-587) and 
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combinatorial chemistry (Salemme et al., Structure 15 (1997), 319-324) for lead 
optimization. Further information that could be taken into account for drug selection 
and design so far available for the localization of agonist and competitive antagonist 
binding sites on nicotinic acetylcholine receptors have recently been reviewed (Arias, 
5 Neurochem. Int. 36 (2000), 595-6450; Corringer et aL, 1999). Once a drug has been 
selected, the method can have the additional step of repeating the method used to 
perform rational drug design using the modified drug and to assess whether said 
modified drug displays better affinity according to for example interaction/energy 
analysis. 

10 

A related method of the present invention for drug design comprises the step of using 
the structural coordinates of the water-soluble ligand-binding protein crystal 
comprising the coordinates of Table 1 , to computationally evaluate a chemical entity 
for associating with the ligand-binding site or a non-specific binding site of a ligand- 

15 binding protein. This approach, made possible and enabled by this invention, is to 
screen computationally small molecule data bases for chemical entities or 
compounds that can bind in whole, or in part, to the AChBP. In this screening, the 
quality of fit of such entities or compounds to the binding site may be judged either by 
shape complementarity or by estimated interaction energy. Meng, et aL, J, Coma. 

20 Chem. 13 (1992), 505-524. In addition, in accordance with this invention, AChBP 
mutants or chimerics may be crystallized in co-complex with known ligand-gated ion 
channel inhibitors. The crystal structures of a series of such complexes may then be 
solved by molecular replacement (for review see for example Brunger er al. Prog. 
Biophys. Mol. Biol. 72 (1999), 135-155; and references cited therein) and compared 

25 with that of wild-type AChBP. Potential sites for modification within the various 
binding sites of the ligand-binding domain may thus be identified. This information 
provides an additional tool for determining the most efficient binding interactions, for 
example, increased hydrophobic Interactions, between AChPB and a chemical entity 
or compound. 

30 The design of compounds that bind to or inhibit ligand-gated ion channels according 
to this invention generally involves consideration of two factors. 
First, the compound must be capable of physically and structurally associating with 
the ligand-binding domain. Non-covalent molecular interactions important in the 
association of the ligand-binding domain with its ligand include hydrogen bonding, 

35 van der Waals and hydrophobic interactions. 

Second, the compound must be able to assume a conformation that allows it to 
associate with the ligand-binding domain. Although certain portions of the compound 
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will not directly participate in this association, those portions may still influence the 
overall conformation of the molecule. This, in turn, may have a significant Impact on 
potency. Such conformational requirements include the overall three-dimensional 
structure and orientation of the chemical entity or compound in relation to all or a 

5 portion of the binding site or the spacing between functional groups of a compound 
comprising several chemical entities that directly interact with the AChBP. 
If the theoretical structure of the given compound suggests insufficient interaction 
and association between it and AChBP, synthesis and testing of the compound is 
obviated. However, if computer modelling indicates a strong interaction, the molecule 

1 0 may then be synthesized and tested for its ability to bind to AChPB or a ligand-gated 
ion channel and functionally tested according to the methods mentioned above. In 
this manner, synthesis of inoperative compounds may be avoided. Once suitable 
chemical entities or fragments have been selected, they can be assembled into a 
single compound or inhibitor. Assembly jnay be proceed by visual inspection of the 

1 5 relationship of the fragments to each other on the three-dimensional image displayed 
on a computer screen in relation to the structure coordinates of AChBP. This would 
be followed by manual model building using software such as Quanta or Sybyl. 
Useful programs to aid one of skill in the art in connecting the individual chemical 
entities or fragments include CAVEAT (Bartlett, et al, "CAVEAT: A Program to 

20 Facilitate the Structure-Derived Design of Biologically Active Molecules". In Molecular 
Recognition in Chemical and Biological Problems", Special Pub., Royal Chem. Soc. 
78 (1989), 182-196); 3D Database systems such as MACCS-3D (Martin, J. Med. 
Chem. 35 (1992), 2145-2154) and HOOK (Molecular Simulations, Burlington, Mass.). 
Instead of proceeding to build an AChBP ligand in a step-wise fashion one fragment 

25 or chemical entity at a time as described above, AChBP binding compounds may be 
designed as a whole or °de novo" using either an empty active site or optionally 
including some portion(s) of a known ligand(s). These methods include LUDI (Bohm, 
J. ComR. Aid. Molec. Design 6 (1992), 61-78); LEGEND (Nishibata and Itai, 
Tetrahedron 47 (1991), 8985); and LeapFrog (Tripos Associates, St. Louis, Mo.). 

30 Other molecular modelling techniques may also be employed in accordance with this 
invention; see, e.g., Cohen, J. Med. Chem. 33 (1990), 883-894 and Navia and 
Murcko, Current Opinions in Structural Biology 2 (1992), 202-210. 
Such computer modeling is preferably performed with a Docking program (Dunbrack 
et al., Protein Sci. 6 (1997), 1661-1681 and Folding Pes. 2 (1997), R27-R42). 

35 

Methods for the identification of drugs or corresponding lead compounds in 
computational prescreen using X-ray crystal structures are described in the prior art 
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(Verlinde and Hoi, Structure 2 (1994), 577-587; Kuntz, Science 257 (1992), 1078- 
1082; Shuker et al. a Science 274 (1996), 1531-1534; Fejzo et al., Chem. Biol. 6 
(1999), 755-769; WO 98/58961), The structural information can be consulted to 
efficiently optimize leads. Computational programs have been written to identify 
5 compounds ranging from very small molecules or functional groups (GRID: 
Goodford, J. Med. Chem. 28 (1985), 849-857; MCSS: Caflish et a!., J. Med. Chem. 
36 (1993), 2142-2167) to potential lead scaffolds (DOCK: Kuntz et al., Accounts 
Chem. Res. 27 (1994), 117-123) using solved X-ray crystal structures. Another 
method computationally prescreens compound libraries and experimentally tests the 
10 individual "hits" by X-ray crystallography (Verlinde et al., J. Comput. Aided Mol. Des. 
6 (1992), 131-147) in order to decrease the size of the screening library. In addition, 
an experimental approach has been developed to find organic solvents that bind to 
active sites that may be recombined into a lead macromolecule (Allen et al., J. Phys. 
Chem. 100 (1996), 2605-2611). 

15 

Once a compound has been designed or selected by the above methods, the 
efficiency with which that compound may bind to the AChBP or a corresponding 
ligand-binding domain may be tested and optimized by computational evaluation. For 
example, a compound that has been designed or selected to function as an inhibitor 

20 must preferably demonstrate a relatively small difference in energy between its 
bound and free states (i.e., a small deformation energy of binding). Thus, the most 
efficient inhibitors should preferably be designed with a deformation energy of 
binding of not greater than about 10 kcal/mole, preferably, not greater than 7 
kcal/mole. Inhibitors may interact with the ligand-binding domain in more than one 

25 conformation that is* similar in overall binding energy. In those cases, the deformation 
energy of binding is taken to be the difference between the energy of the free 
compound and the average energy of the conformations observed when the inhibitor 
binds to the AChBP. 

30 A compound designed or selected as binding to AChBP may be further 
computationally optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target ligand-binding domain. Such non- 
complementary (e.g., electrostatic) interactions include repulsive charge-charge, 
dipole-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic 

35 interactions between the ligand and the AChBP when the ligand is bound to AChBP, 
preferably make a neutral or favorable contribution to the enthalpy of binding. 
Specific computer software is available in the art to evaluate compound deformation 
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energy and electrostatic interaction. Examples of programs designed for such uses 
include Gaussian 92, revision C (Frisch, Gaussian, Inc., Pittsburgh, Pa.); AMBER, 
version 4.0 (Kollman, University of California at San Francisco); QUANTA/CHARMM 
(Molecular Simulations, Inc., Burlington, Mass.); and Insight ll/Discover (Biosysm 
5 Technologies Inc., San Diego, Calif.). These programs may be implemented, for 
instance, using a Silicon Graphics workstation, IRIS 4D/35, IBM RISC/6000 
workstation model 550 or better a Unix workstation (SGI, Alpha, Sun, etc.) or any 
Linux PC. Other hardware systems and software packages will be known to those 
skilled in the art. 

10 Once an AChBP-binding compound has been optimally selected or designed, as 
described above, substitutions may then be made in some of its atoms or side 
groups in order to improve or modify its binding properties. Generally, initial 
substitutions are conservative, i.e., the replacement group will have approximately 
the same size, shape, hydrophobicity and charge as the original group. It should, of 

1 5 course, be understood that components known in the art to alter conformation should 
be avoided. Such substituted chemical compounds may then be analyzed for 
efficiency of fit to AChBP by the same computer methods described in detail, above. 
As mentioned before, the above described methods of the present invention can also 
be used as an initial drug screening assay followed by a classical drug screening 

20 assay using the biochemical assays known in the art. 

Methods for the preparation of compounds, chemical derivatives and analogues are 
well known to those skilled in the art and are described in, for example, Beilstein, 
Handbook of Organic Chemistry, Springer edition New York Inc., 175 Fifth Avenue, 
New York, N.Y. 

25 

In one embodiment of the method of the present invention the identified drug 
prevents or promotes correct assembly of a ligand-gated ion channel. Thus, the 
selected drug may for example either interfere with the contact regions of the 
monomers of the ligand-gated ion channel or may act as a scaffold for the assembly. 
30 In the latter case, the drug may be based lor example on an antibody which binds to 
the contact regions of two or more monomers when assembled and thus facilitates 
the assembly process. Preferred contact regions with respect to the AChBP and the 
related nicotinic acetylcholine receptor are given below. In a still further embodiment 
of the above described methods, the drug can be selected such as to bind to a non- 
35 specific binding site of a ligand-gated ion channel. The non-specific binding site can 
for example include those contact regions that are highly conserved between the 
monomers of the ligand-gated ion channels. 
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Once a drug has been selected in accordance with any one of the above described 
methods of the present invention, the drug or a pro-drug thereof can be synthesized 
in a therapeutically effective amount. As used herein, the term "therapeutically 
5 effective amount" means the total amount of the drug or pro-drug that is sufficient to 
show a meaningful patient benefit, i.e., treatment, healing, prevention or amelioration 
of a condition related to an ligand-gated ion channel, or an increase in rate of 
treatment, healing, prevention or amelioration of such conditions. In addition or 
alternatively, in particular with respect to pre-clinlcal testing of the drug the term 
10 "therapeutically effective amount" includes the total amount of the drug or pro-drug 
that is sufficient to elicit a physiological response upon its binding to its target ligand- 
gated ion channel in an non-human animal test. 

The present invention also relates to a drug produced by any one of the above 
15 described methods of the present invention, or a pro-drug thereof. Preferably, the 
drug or pro-drug thereof is present either alone or in a composition in a 
therapeutically effective amount 

The drug obtained by a method of the present invention may be characterized by its 
interaction with the binding sites in the binding cavity defined by the coordinates of 

20 crystal structure of the protein-ligand complex; for examples of such characterization 
see, e.g., US-A-5,798,247. Preferably, the drug, for example a potential inhibitor will 
form non-covalent bonds with one or more amino acids in the active site based upon 
the crystal structure. On the other hand, the drug may bind to a contact region of the 
individual monomers of the pentameric ligand-gated receptor. For example, multimer 

25 contact regions in Lymnaea stagnalis AChBP (SEQ ID No. 2) have been identified. 
Consecutive regions have at least every second residue involved in contacts with the 
other monomer. Contacts have been defined as 2 atoms within 4.2 angstrom 
distance in 2.7 Angstrom structure. The primary contact regions in mature AChBP 
(residues from A contacting B) are 15-21, 44-47, 85-87, 91-94, 122-124,143-146, 

30 1 49, 1 85-1 87 and the complementary contact regions (from B contacting A, (identical 
to residues on A contacting E) are 3-4, 7-8, 11, 37-39, 53, 75-77, 96-104, 114-118, 
163-170; see also Figure 14. 

Thus, in one preferred embodiment the drug of the present invention interacts with a 
ligand-gated receptor comprising a pentamer with monomers A to E, wherein the 
35 drug binds to one or more primary contact regions of a monomer (residues from A 
contacting B) defined by amino acid residues 15 to 21, 44 to 47, 85 to 87, 91 to 94, 
122 to 124, 143 to 146, 149, 185 to 187 of SEQ ID No. 2 and/or to one or more of the 
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complementary contact regions of the other monomer (from B contacting A, (identical 
to residues on A contacting E) defined by amino acid residues 3 to 4, 7 to 8, 1 1 , 37 to 
39, 53, 75 to 77, 96 to 1 04, 1 1 4 to 1 1 8 and 1 63-1 70 of SEQ ID No. 2; or to one of the 
contact regions identified in Figure 14; or to the corresponding contact regions of the 
5 monomers of a ligand-gated ion channel. Preferably, the ligand-gated ion channel is 
the nicotinic acetylcholine receptor and the order of the monomers is ayaSp. 

Any available method may be used to construct such model from the crystaliographic 
and/or amino acid sequence data disclosed herein or obtained from independent 

10 analysis of crystalline AChBP proteins or other water-soluble ligand-binding proteins 
of the present invention. Such a model can be constructed from available analytical 
data points using known software packages such as HKL, MOSFILM, XDS, CCP4, 
SHARP, PHASES, HEAVY, XPLOR, TNT, NMRCOMPASS, NMRPIPE, DIANA, 
NMRDRAW, FELIX, VNMR, MADIGRAS, QUANTA, BUSTER, SOLVE, O, FRODO, 

15 RASMOL, CNS , REFMAC, ARP/WARP, XTALVIEW and CHAIN. The model 
constructed from these data can then be visualized using available systems, 
including, for example, Silicon Graphics, Evans and Sutherland, SUN, Hewlett 
Packard, Apple Macintosh, DEC, IBM, and Compaq. The present invention also 
provides for devices such as a computer system which comprises the model of the 

20 invention and hardware used for construction, processing and/or visualization of the 
model of the invention. Further embodiments provide a computer system comprising 
computer hardware and the model of the present invention. The study of the 
interaction of the candidate species with the model can be performed using available 
software platforms, including QUANTA, RASMOL, O, CHAIN, FRODO, INSIGHT, 

25 DOCK, MCSS/HOOK, CHARMM, LEAPFROG, CAVEAT (UC Berkley), CAVEAT 
(MSI), MODELLER, CATALYST, XTALVIEW and ISIS. Computer readable media 
such as floppy discs, CD ROMs, tapes, and any other storage or processing means 
comprising crystaliographic and/or nucleotide/amino acid sequence data disclosed 
herein or obtained from independent analysis of crystalline AChBP proteins or other 

30 water-soluble ligand-binding proteins of the present invention are subject of the 
present invention as well. Any one of the mentioned means and devices can 
advantageously be used for modeling an antagonist/inhibitor or agonist/activator of a 
ligand-gated receptor. 

Furthermore, the present invention relates, to the construction of theoretical three 
35 dimensional (3D) models of ligand-binding domains of ligand-gated ion channels by 
computer-assisted molecular modeling using the X-ray coordinates of the water- 
soluble ligand-binding proteins of the invention. These 3D models can correspond 
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either to the entire ligand-binding domain (-220 to 240 extracellular amino acids) or 
may be limited to the ligand-binding site. 

The concept of using 3D structures of the mollusc ligand-binding proteins for 
molecular modeling and tool for structure prediction of for example mammalian, in 
5 particular human ligand-gated ion channels gains support from the observation that 
the ligand-binding domain of vertebrate glutamate receptor channels and bacterial 
periplasmic substrate-binding proteins (PBPs) share similar 3D structures despite the 
very low sequence similarity between ionotropic glutamate receptor subunits and the 
PBPs that were used as templates (12%); for review see Paas et at. TiPS 21 (2000), 

1 0 87- 92 and ref ernces cited therein 

Thus, on the basis of a computer-assisted molecular modeling, optionally 
supplemented by for example functional studies of site-specific mutants, the crystal 
structure of the ligand-binding domain of ligand-gated ion channels and theoretical 
3D models of these domains can be predicted. In turn, these models can be used for 

15 structure assisted drug design. The predicted models may be further refined, for 
example by monitoring the effects of mutations of amino acid residues that are 
probably located in the ligand-binding site on (1) agonist-elicited channel activation 
and desensitization, (2) inhibition of channel activity by various competitive receptor 
antagonists; or (3) the binding of various ligands. Experimental setups for analyzing 

20 such effects are known to the person skilled in the art, see also the documents cited 
for functional assay systems of ligand-gated ion channels. 

Thus, the embodiments of the present invention enable various possibilities for 
Identification and modeling new ligands of ligand-gated ion channels as well as 
modifying the ion channels themselves. Accordingly, the present invention relates to 
25 the use of the above described polynucleotides, proteins, dimers and pentamers, 
ligand-gated ion channels, vectors, host cells, antigens, antibodies, oligonucleotide 
probes, crystals, their structural coordinates and methods for screening or profiling 
putative ligands of ligand-gated receptors. 

Methods for the lead generation in drug discovery using proteins and detection 
30 methods such as mass spectrometry (Cheng et al. J. Am. Chem. Soc. 117 (1995), 
8859-8860) and some nuclear magnetic resonance (NMR) methods (Fejzo et al M 
Chem. Biol. 6 (1999), 755-769; Lin et a)., J. Org. Chem. 62 (1997), 8930-8931). 

The newly identified drug obtained by a method of the present invention, i.e. an 
35 antagonist/inhibitor or agonist/activator can be used for the preparation of a 
pharmaceutical composition for the treatment of a ligand-gated ion channel mediated 
or related disorder. Such disorders are well know to the person skilled In the art. For 
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example, possible applications of agonist and antagonists to nAChRs are based on 
their participation in complex functions such as attention, memory, and cognition, and 
their involvement in the pathogenesis of certain neuropsychiatric disorders 
(Alzheimer's and Parkinson's diseases, Tourette's syndrome, schizophrenia, 
5 depression, etc). For the majority of these disorders, the use of nAChRs* agonists 
may represent either a prophylactic (esp. for Alzheimer's and Parkinson's diseases) 
or a symptomatic treatment; for review see for example Mihailescu and Drucker- 
Colin, Arch. Med. Res. 31 (2000), 131-144. 

The medicinal chemistry and molecular biology of GABA-activated ligand-gated ion 
10 channels also in terms of agonist and antagonist structural profiles is described in 
Chebib et al. f J. Med. Chem. 43 (2000), 1427-1447. 

Glycine receptors and disorders of glycinergic neurotransmission are extensively 
reviewed in Rajendra et ai., Pharmacol. Ther. 73 (1997), 121-146 and Barry et al., 
Clin. Exp. Pharmacol. Physiol. 26 (1999), 935-936. 
1 5 The central role of 5-HT3 receptor in CNS disorders and 5-HT3 receptor antagonists 
are described in Bloom and Morales, Neurochemical Research 23 (1998), 653-659 
and Higgins and Kilpatrick, Expert Opin. Invest. Drugs 8 (1999), 2183-2188. 

In one embodiment, the antagonist/inhibitor is or is derived from a protein, an 
20 antigen, antibody or from a toxin of the ligand-gated ion channel. Likewise, the 
agonist/activator can be derived from a protein, an antigen, antibody or from a toxin 
of the ligand-gated ion channel. Possible starting points comprise for example 
peptide toxins, e.g., conotoxin (IMI) and alpha bungarotoxin, lophotoxins 
(Bippinatins), tubocurarine, decamethonium, alpha-cobratoxin, epibatidine, 
25 acetylcholine, choline, nicotine, carbachol, serotonin or GABA. The structure of these 
molecules together with that of the crystal of the target ligand-binding domain can be 
used to model the compound and elucidate side chains, functional groups etc. which 
may be added, deleted or modified in order to improve for example affinity and/or 
specificty of the drug or for example make a drug which acts on a different target 
30 non-reactive with a certain ligand-gated ion channel. 

In a preferred embodiment for the uses according to the present invention, the 
ligand-gated ion channel is the nicotinic acetylcholine receptor and said mediated or 
related disorder is Tourette's syndrome, Alzheimer's disease, addiction to nicotine or 
schizophrenia. 

35 

As mentioned herein before, this is the first time it could be shown that water-soluble 
ligand-binding proteins exists in molluscs, which closely resemble the ligand-binding 
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domain of ligand-gated ion channel of higher mammals. It is expected that similar 
ligand-binding proteins exist in other moiluscan species or even in the lineage the 
Mollusca, Protostomia, Coelomata, Bilateria, Eumetazoa, Metazoa, Fungi/Metazoa 
group. Accordingly, the present invention also relates to the use of a ligand of a 

5 ligand-gated ion channel for identifying and isolating a water-soluble ligand-binding 
protein from such species, preferably from a mollusc. Preferably, the ligand used for 
the isolation of the protein is a-bungarotoxin. The water-soluble ligand binding 
proteins obtainable from these organisms as well as derivatives. that can be made in 
accordance with the teaching present herein are also subject of the present 

10 invention. 



Furthermore, for the first time the crystal structure of a nicotinic binding site has been 
revealed. This crystal structure shows that the moiluscan AChBP is a homolog of the 
LGIC superfamily ligand binding domains. It reveals the Ig-topology, the location of 

15 the binding site at the subunit interface, the position of the MIR and the extensive 
data on the nicotinic ligand binding residues. Importantly, it gives important new 
information about the exact fold and the arrangement of the nicotinic ligand-binding 
site in three dimensions. It shows the presence of a second pocket that has been 
noticed by EM analysis. Furthermore, it clarifies the arrangement of subunits by 

20 showing the relative positioning of the principal and complementary part of the 
ligand-binding site. It provides an explanation of the role of the LGIC superfamily 
conserved residues in stabilizing the monomer structure by the formation of 
hydrophobic cores and packing of secondary structure elements and it makes clear 
how the pentamers are built up, and how weakly the pentamer interfaces are 

25 conserved between LGICs. 

This structure can be used for the numerous drug-design studies that are targeting 
the LGIC superfamily. The general structural knowledge on its folding will be 
applicable to the GABA, serotonin (5HT 3 ) and glycine receptor fields. It will help to 
understand their ligand-binding characteristics and could thus have impact on 

30 development of e.g. ariti-emetics aimed at the 5HT 3 receptor or the mood-defining 
drugs that target the GABA receptors. However, the availability of a three- 
dimensional description of the nicotinic ligand-binding site will be especially relevant 
for the design of new drugs against Alzheimers' disease, epilepsy and the addiction 
to smoking which have the neuronal nicotinic receptors as their targets. 

35 

Many embodiments and the examples feature the acetylcholine-binding protein 
(AChBP) of the invention and the embodiments generally described herein are 
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preferably related to the nicotinic acetylcholine receptor (nAChR), more preferably to 
the alpha subunit, and most preferably to the alpha 7 subunit. However, it should be 
understood that all embodiments equally apply to the other water-solubie ligand- 
binding proteins and generally to the ligand-gated ion channels mentioned herein. 
5 For example, the crystal structure of the AChBP can be used to model new ligands 
for the acetylcholine receptor, preferably such with inhibiting or stimulating action on 
the acetylcholine receptor. Likewise, it is possible to identify and model new ligands 
for other ligand-gated ion channels (including glycine, GABA and serotonin receptor) 
with inhibiting action. Such ligands may for example prevent correct assembly of 
10 ligand gated ion channels. Preferably such ligands prevent correct assembly of 
specific sub types of ligand gated ion channels. On the other hand, ligands can be 
identified and modeled that promote correct assembly of ligand gated ion channels, 
preferably of specific sub types of ligand gated ion channels. As mentioned before, 
the methods of the present invention also allow modeling inhibitors for the non- 
15 specific binding site of ligand gated ion channels. 

In addition, it is possible to predict and create mutants and chimeras of AChBP with 
modified assembly behaviour, modified ligand binding behavior such as with 
i ncreasec l resemblance of the binding site to the acetylcholine receptor subtype on 
the primary binding site and generally with increased resemblance to particular 
20 ligand-gated ion channels in activity and conformational changes. In view of the 
closest relationship between AChBP and the acetylcholine receptor it is particular 
preferred to create mutants and chimeras with increased resemblance of the binding 
site to the acetylcholine receptor subtype on the secondary binding site. However, 
the prediction and creation of mutants and chimeras with increased resemblance of 
25 the binding site to other ligand gated ion channels subtype on the primary binding 
site or on the secondary binding site are envisaged as well. 



These and other embodiments are disclosed and encompassed by the description 
and Examples of the present invention. Further literature concerning any one of the 

30 antibodies, methods, uses and compounds to be employed in accordance with the 1 
present invention may be retrieved from public libraries and databases, using for 
example electronic devices. For example the public database "Medline" may be 
utilized which is available on the Internet, for example under 
http://www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and 

35 addresses, such as http://www.ncbi.nlm.nih.gov/, http://www.infobiogen.fr/, 
http://www.fmi.ch/bioIogy/researchJoois.html, http://www.tigr.org/, are known to the 
person skilled in the art and can also be obtained using, e.g., http://www.lycos.com. 
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An overview of patent information in biotechnology and a survey of relevant sources 
of patent information useful for retrospective searching and for current awareness is 
given in Berks, TIBTECH 12 (1994), 352-364. 

5 This disclosure may best be understood in conjunction with the accompanying 
drawings, incorporated herein by references. Furthermore, a better understanding of 
the present invention and of its many advantages will be had from the following 
examples, given by way of illustration and which are not intended as limiting. 

10 Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), Molecular 
Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY or in 
Volumes 1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular Biology, 
Current Protocols. Standard materials and methods for plant molecular work are 

15 described in Plant Molecular Biology Labfase (1993) by R.D.D. Croy, jointly 
published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific 
Publications (UK). 

20 Brief description of the drawings 



Figure 1: Clustal X (1.8) multiple sequence alignment of AChB.P amino acid 
sequences. The AChBP alignment was made using "ClustalX_1.8" 
(Thompson et al., Nucleic Acids Research 24 (1997), 4876-4882. The 
subsequent alignment was further processed using "Genedoc" version 
2.5.000 (Nicholas et al. (1997) Genedoc a tool for editing and 
annotating multiple sequence alignments). Identical amino acids are 
indicated with "*", equivalent amino acid with and similar amino 
acids with Glycosylation sites are Asn 66 for L- AChBP and Asn 21 
and 26 for B- AChBP in the amino acid sequence of the respective 
mature AChBP SEQ ID No. 2 and 4, and 6 and 8, respectively. 

Figure 2: Hydrophobicity plots of the mature AChBP amino acid sequences. The 
B&L-AChBP hydrophobicity plots were made using "Protein sequence 
35 analyses" according to the method described in Kyte and Doolite (J. 

Mol. Biol. 157 (1982), 105-132). 2A: L-AChBP_T1 (SEQ ID No. 2), 2B: 
L-AChBP_T2 (SEQ ID No. 4), 2C: B-AChBP_T1 (SEQ ID No. 6), 2D: 
B-AChBP_T2 (SEQ ID No. 8). 
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30 



WO 01/58951 , Pag e 46 of 259 

WO 01/58951 PCT/EP01/01457 

-45 

Figure 3: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of the ligand-binding 
domains of the ligand-gated receptors nAChR-a7, GABA A R-pi, 5- 
5 HT3R and GlyR-a1. Sequence alignment and processing was 

performed as described for Figure 1 . The accession numbers of the 
amino acid sequences used for the alignment are as follows: Human 
alpha"! : Human alpha7: Y08420; Human 5HT3: CAA06442; Human 
QlyRjalphal: S12382; Human GABAaJ>1: NPJ300797. A similar 
10 sequence alignment can be performed with the corresponding rat 

sequences (ratnAChRa7_Q05941 , rat5HT3R_P35563, 

ratGABARb1_P15431 , ratGlyRa1_p24524) which will give 
substantially similar if not identical results. 

15 Figure 4: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of nAChRs. Sequence 
alignment and processing was performed as described for Figure 1. 
The accession numbers of the amino acid sequences used for the 
alignment are as follows: Human alphal: ACHUA1; Human alpha2: 

20 AAG23253; Human alpha3: A53956; Human alpha4: P43681 ; Human 

alphas: P30532; Human alpha6: Q15825; Human alpha7: Y08420; 
Human alpha9: CAB65091. A similar sequence alignment can be 
performed with the corresponding rat sequences 
(ratnAChRa7_Q05941 , rriAChRa9_P431 44, rnAChR2_P1 238, 

25 rnAChRa3_P04757, rnAChRa4_P09483) which will give substantially 

similar if not identical results. 

Figure 5: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of nAChRs alpha 1 and 7. 
30 Sequence alignment and processing was performed as described for 

Figure 1 . The accession numbers of the amino acid sequences used 
for the alignment are as follows: Human alphal: ACHUA1; Human 
alpha7: Y08420. A similar sequence alignment can be performed with 
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the corresponding rat sequence ratnAChRa7_Q05941 which will give 
substantially similar if not identical results. 



5 Figure 6: 



10 



The pentameric structure of AChBP. a In this schematic representation 
each monomer has a different grey level. Subunits are labeled anti- 
clockwise, with A-B, B-C, C-D, D-E and E-A forming the plus and 
minus interface side, with the principal and complementary ligand- 
binding sites respectively (ball-and-stick representation), b Viewing the 
AChBP pentamer perpendicular to the five-fold axis. The equatorially 
located ligand-binding site (ball-and-stick representation) is highlighted 
only in the A (light) and B (dark) interface. 



Figure 7: The AChBP monomer. Ribbon representation of the AChBP monomer. 

15 The secondary structure starting from the N-terminus (top) towards the 

C-terminus (bottom). The monomer is viewed towards the center of 
the pentamer. In the nAChR, the top would correspond to the N- 
terminus of the ligand binding domain, pointing towards the synaptic 
cleft, while the C-terminus would be entering the membrane at the 

20 bottom, continuing into the transmembrane domain. The AChBP 

• monomer is built up mainly of B-strands, except for an N-terminal (- 
helix. It contains 14 B-strands that are organized in the two antiparallel 
B-sheets, with an immunoglobulin topology. However, in contrast to 
the classical immunoglobulin fold, the AChBP B-sheets are rotated 

25 against each other, forming a small pocket, as visible in Figure 6. 



Figure 8: The ligand-binding site at dimer interface. Ribbon representation of 
two neighboring AChBP monomers. Monomer A is shown in grey and 
monomer B in dark grey. The ligand-binding site is located at the 

30 interface between two monomers. As predicted for the nAChRs, the 

acetylcholine binding site in AChBP occurs at the interface between 
two neighboring subunits. Similar to the model proposed for the 
nAChRs, the ligand-binding site is asymmetric, formed mainly by 
aromatic residues. Residues from mature AChBP monomer A 

35 (TyrA89, TrpA143, TyrA185, CysA187, CysA188 and TyrA192) form 

the principal component, while residue TrpB53 from monomer B 
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creates the complementary part of the ligand-binding site. There are 
five identical ligand-binding sites in the AChBP pentamer, similar to 
the homomeric a7 neuronal receptor. 



5 Figure 9: 



10 



The ligand binding site. Stereo figure showing the ligand binding site in 
AChBP, at the interface of two monomers. Residues from mature 
AChBP monomer A (TyrA89, TrpA143, TyrA185, CysA187, CysA188 
and TyrA192) form the principal component, while residue TrpB53 
from monomer B creates the complementary part of the ligand-binding 
site with with additional residues ArgB104, LeuB112 and MetB114. 
There are five identical ligand-binding sites in the AChBP pentamer, 
similar to the homomeric a7 neuronal receptor. 



Figure 10: Multiple sequence alignment of AChBP amino acid sequences with 
1 5 indication of secondary structure and solvent accessibility derived from 

the crystal structure. Alignment of the four molluscan AChBP 
sequences, with secondary structure and solvent accessibility of the 
Lymnea stagnalis AChBP-1 indicated from the crystal structure. The 
Figure was prepared with ESPript (Gouet et al., Bioinformatics. 15 
20 (1999), 305-308), using DSSP (Kabsch and Sander, Biopolymers. 22 

(1983), 2577-2637). Under the alignment the solvent accessibility is 
indicated, white most buried, dark blue most exposed, according to 
ESPript defaults (blue A> 0.4, cyan 0.1 <A<0.4, white A<0.1). 



25 Figure 11: 



30 



35 



Sequence alignment of AChBP with LGICs. The alignment shows only 
the N-terminal domain of the LGIC subunits and is based on a multi- 
sequence alignment of 92 full-length LGIC sequences. Abbreviations 
used, H and Tea, stand for human and Torpedo calif ornica. Secondary 
structure elements (a:ot-helix, p: p-strand, r\: 3io-helix) are indicated 
above the sequence, in accordance with Fig 12a. AChBP shares 23% 
sequence identity with the ligand-binding domain of human ct 7 . The 
LGIC conserved residues (bold, grey background) are displayed. 
Beginning and end of the Cys-loop are indicated by a "* w . Nicotinic 
receptor ligand-binding residues on the principal and complementary 
side are indicated. 
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Figure 12: Overview of the AChBP monomer structure, a Stereo representation of 
the AChBP monomer as viewed from outside the pentameric ring. 
Disulfide bridges are indicated in ball-and-stick representation. In a 
complete ion-channel the N-terminus would be pointing towards the 
5 synaptic cleft, while the C-terminus would enter the membrane at the 

bottom, continuing into the first transmembrane domain, b Topology 
diagram of the AChBP monomer. For comparison with Ig-folds the 
strands have been labeled a-g, showing the additional strand (b') and 
hairpin (f-f). In this structure, strands have been labeled 01-010 with 

10 loops (or turns) L1-L10 preceding each strand with the same number. 

The |35 strand is broken (05-05') with internal loop L5\ |36 also has a 
small break, but is shown continuously; (see Fig. 11). The precise 
beginnings and ends of strands may change slightly with increasing 
resolution, but the topology seen here will be highly conserved across 

1 5 the entire family of LGICs. 



Figure 13: The ligand-binding site, a Stereo representation of the ligand-binding 
site in ball-and-stick representation, showing the contribution of the 
principal A (TyrASQ/chTyrQS), B (TrpA143/ctiTrp149) and C 
20 (TyrA1 85/^Tyrl 90, CysA1 87/ ai Cys1 92, CysA1 88/ciiCysl 93, TyrA1 92 

fa{Tyr\98) and the complementary D (TrpB53/yTrp55, GlnB55/yGlu57) 
, E(ArgB104/yLeu109, ValB106/yTyr111, LeuB112/yTyr117, MetB114 
/yl_eu119) and F (TyrB1 64) 'loops', b Stereo view of the electron 
density map displaying a HEPES buffer molecule in the ligand-binding 
25 site. This experimental density (contoured at 1 a) is derived from 

cross-crystal averaging, c Location of the principal ligand-binding 
residues on the monomer, d Location of the complementary ligand- 
binding residues on the monomer, (orientation as in Figure 6b) 

Dimer interface a Stereo figure of the dimer interface. Representation 
of the interface residues (ball-and-stick) on a schematic secondary 
structure figure. The figure shows the plus face of subunit A and the 
minus minus face of subunit B b Dimer interface interactions. Note that 
due to the low conservation of these interfaces (Fig. 11) the actual 
interactions will not be conserved in any LGIC interface, but that in all 
receptors the topological regions are likely to form the interface. 



30 Figure 14: 
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Figure 15: Conservation in the LGIC superfamily. Conserved residues are 
indicated on the top, middle and bottom respectively on the monomer 
as viewed from the centra! pore. The hydrophilic conserved residues 
are indicated in dark. Conserved residues are indicated as viewed 
from the central pore. Hydrophobic Cluster I: residues 6, 10, 63, 65, 
71, 81, 105, 111; Cluster II: residues 20, 27, 29, 31, 58, 82, 84, 86, 
140, 150, 152, 195; Cluster III: residues 33, 35, 38, 41, 48, 52, 125, 
138, 171, 173, 199, 201. The hydrophilic conserved residues: Asp60, 
Asp85, Asn90, Gly109, Cys123, Cys136, Lys203. Conserved residues 
in the ligand binding site: 106, 145, 192. These three and Lys203 are 
the only conserved residues without structural role in the monomer. 
Note how very few conserved residues are at the surface. Within the 
LGIC family the Cys-loop residues are also highly conserved; see 
bottom, left. 
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EXAMPLES 

EXAMPLE 1: Isolation of Lymnaea AChBP from the CNS, determination of 
5 mass and N-terminal protein sequence 

Isolation : 80 CNS of Lymnaea were homogenized in lysis buffer (PBS [16 mM 
Na2HP04, 4 mM Na^PO^ pH 7.4; 150 mM NaCI] 0.5% Nonidet P-40; 0.1% triton, 
0.2% tween-20) containing 1 ug/ml aprotinin, 10 ug/ml benzamidine, 0.5 ug/ml 
leupeptin, 24 ug/ml pefabloc. The CNS lysate was cleared by triplicate centrifugation 

10 at 12,000x g for 5 min. Streptavidin-coated magnetic beads (Dynal, Oslo), 5 mg, 
were saturated with a-bungarotoxin conjugated to Biotin (4 ug) (Molecular Probes, 
Oxford, UK). These beads were washed in PBS to remove excess a-bungarotoxin, 
then added to the cleared CNS lysate, and incubated for 1 h. After this, beads were 
washed 3 times in PBS to remove unbound protein. A control reaction without a- 

15 bungarotoxin was performed. Proteins bound to a-bungarotoxin were allowed to elute 
off in 1 0 mJ of PBS containing 1 0-4 M nicotine for 1 h. 

Mass determination: The eluent was separated on a microcolumn LC system was 
similar to that described previously (Hsieh et al.; Anal. Chem. 70; 1998; 1847-1852). 
A commercial syringe pump (Perkin-Elmer/ABI, model 140B) was used to deliver a 

20 flow rate of 20 \i\ /min to the column. After loading of sample to the column the flow 
rate was dropped to 10 \x\ /min. The eluent was then switched from 0.2% acetic acid 
to 0.2% acetic acid/ 60% acetonitril in 1 min. Electrospray mass spectra in MS mode 
were acquired on a Micromass Q-TOF quadropole time-of-flight mass spectrometer 
equipped with a Z-spray atmospheric pressure ionization source. 

25 Protein sequence analysis : For sequence analysis a-bungarotoxin binding protein 
was extracted using the same procedure, now followed by SDS-PAGE and Western 
Blotting on PDVF membrane. Sequence analysis was performed with Edman 
degradation of the 24 kDa blotted protein (apparent MW) using a protein sequencer 
(ABI, Perkin Elmer). 

30 

EXAMPLE 2: Cloning the Lymnaea AChBP cDNA sequence: PCR and 
screening of a cDN A library 

PCR cloning : A degenerate oligonucleotide was synthesized based on the amino 
acid sequence LDRADILYNI (SEQ ID No. 10), residues 1-10, of AChBP, (5'- 
35 CGGATCCGA(TC)(AC)GIGC(GATC)GA(TC)AT(ATC)(TC)T(GATC)TA(TC)AA(TC)A 
T-3 1 ; SEQ ID No. 11), containing a BamHI restriction site, and used in combination 
with a primer on the IZAPII lambda vector. PCR was performed on a IZAP II cDNA 
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library of the Lymnaea CNS, in a 100 (xl reaction volume with 1.0 unit of Super Taq 
DNA polymerase (Boehringer Mannheim, Germany) in a DNA thermal cycler (Perkin- 
Elmer Cetus, CT) using 45 cycles of (94 °C, 20 sec; 53 °C, 30 sec; and 72 °C, 1 min. 
Amplified cDNA was digested with SamHI and EcoRI, separated on agarose gel, and 
5 a product of -900 bp was cloned and sequenced. 

Library screening : Approximately 20,000 clones of the amplified lambda ZAP II CNS 

cDNA library were plated at a density of 10 5 pfu/400 cm 2 and absorbed to charged 
Nylon membranes (Boehringer Mannheim, Germany). The AChBP PGR product was 
used as a random primed probe, labeled with [alfa 32 P]dATP (specific activity >10 9 

1 0 cpm/mg). Membranes were hybridized in 6x SSC (1 x SSC; 0.1 5 M NaCI and 0.01 5 M 
Na-citrate) f 0.2% SDS, 5x Denhardts and 10 ug/ml herring sperm DNA at 65 °C for 
18 h. The filters were washed in 0.2x SSC, 0.2% SDS, at 65 °C for 30 min, and 
autoradiographed. Four individual cDNA clones were in vivo excised, and sequenced 
using dideoxy chain termination in both orientations. Two types of sequence were 

15 obtained, named L-AChBP_T1 and L-AChBP_T2. The signal sequences were 
determined with "SMART", Simple Modular Architects Research Tool (V3.1); see 
Schultz et al. a Proc. natl. Acad. Sci. USA 95 (1998), 5857-5864 and Nucleic Acids 
Res. 28 (2000), 231-234. In case of L-AChBP_T1 (SEQ ID No. 2) the prediction 
could experimentally be confirmed. 

20 

EXAMPLE 3: Lymnaea AChBP-related sequences: cloning of the Bulinus 
truncatus cDNAs 

Total RNA was isolated from Bulinus brain ganglia (CNS), and reverse transcribed 
into hexanucleotide primed cDNA. Two degenerate oligonucleotides, directed to the 

25 Lymnaea AChBP JT1 sequence, forward primer: 5'- 
GCGAATTCGAYACIGARWSIGGNGCNACNTG-3 , (SEQ ID No. 12), reverse primer: 
S'-GCGMGCTTCRTCYTCRTAIGCYTCNGCRCARC-S 1 (SEQ ID No. 13), were used 
to amplify AChBP-related sequences. PCR was performed on one animal equivalent 
of CNS cDNA using 150 pmole of each primer under standard conditions for 45 

30 cycles (94'C, 20 sec; 54' C, 30 sec; 72 'C, 1 min). Amplified cDNA was EcoRI/Hindlll 
digested, cloned into EcoRI/Hindlll digested pBluescript, and sequenced. The ORFs 
of the obtained sequences showed a Bulinus AChBP, sequence-related to Lymnaea 
AChBP, named B-AChBP_T1 . This partial cDNA was used to screen a Bulinus brain 
cDNA library using the same hybridization protocol as described for the cloning of the 

35 Lymnaea cDNAs, and yielded two cDNA clones, encoding B-AChBPJH and B- 
AChBP_T2. Sequencing of the cDNAs was performed in both orientations. 
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EXAMPLE 4: The production of L-AChBP-T1 and -T2 and B-AChBP-T1 and -T2 
in the yeast Pichia pastoris and functional characterization 

5 Production of recombinant AChBP: 

In order to produce L-AChBP_T1 and T2 and B-AChBPJT1 and T2 as recombinant 
proteins in the Pichia pastoris expression system (Pichia Expression Kit version 3.0, 
Invitrogen), the DNA sequence encoding the mature form of these proteins (see 
sequence files) was cloned into the pPIC9 expression vector (Invitrogen). The mature 

1 0 sequences of L-AChBP_T1 , T2 and B-AChBP_T1 and T2 were PCR amplified (using 
Pfu-taq DNA polymerase (Stratagene) in order to avoid introduction of errors into the 
sequence due to PCR) and restriction sites were added to the primers to allow rapid 
pPIC9 compatible cloning. The amplified sequence of mature AChBP L-AChBP_T1 
was EcoRl inserted into pPIC9, whereas L-AChBP_T2 and B-AChBP_T1 and T2 

1 5 were Xhol/EcoRI inserted into pPIC9 (the alpha-mating factor cleavage site was fully 
reconstructed after Xhol digestion). 

Constructs with and without an additional C-terminal His-tag (SRGHHHHHH (SEQ ID 
No. 14) in the case of L-AChBP_T1, EFKDDDDKHHHHHH (SEQ ID No. 15) 
otherwise) were generated for each of the AChBP (sub)types. The AGhBP/pPIC9 

20 constructs were amplified in E. coli DHSccF and isolated and purified using the 
plasmid Maxi Kit (Qiagen). Due to the engineered cleavage site at the N-terminus of 
the amino acid sequence four additional amino acids (EAEA, SEQ ID No. 16) will 
precede the N-terminus of the original mature protein. Prior to transfection into Pichia 
pastoris the constructs were linearised (for protocol see supplier's manual; Pichia 

25 Expression Kit version 3.0, Invitrogen) and subsequently purified by 
phenol/chloroform extraction, and ethanol precipitation. Approximately 5 \xg of each 
of the linearised constructs was transformed into freshly prepared electro-competent 
Pichia pastoris cells and plated onto MD plates (for protocol see supplier's manual; 
Pichia Expression Kit version 3.0, Invitrogen corporation). Electrocompetent Pichia 

30 pastoris cells were aquired according to the protocol provided by Invitrogen. Plates 
were incubated at 30°C until the appearance of Pichia colonies, which were 
subsequently analysed for the presence of the correct insert by PCR amplification 
(for protocol see supplier's manual; Pichia Expression Kit version 3.0, Invitrogen). 
Colonies containing an homologous recombination with the Pichia genome, carrying 

35 the AChBP sequence, were grown in 25 ml of BMGY for 1-2 days (30°C; rotation at 
250rpm), after which the cells were centrifuged (10 min., 1500g) and the cell pellet 
was resuspended into 10 ml of BMMY. Growth (30°C, 250rpm) was continued for an 
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additional 4 days (day 3-6), during which the expression of AChBP was induced by 
the addition of 100% methanol (1% of total culture volume) once every 24 hours. At 
day seven the culture was centrifuged (15 min.; 2000g; 4°C) and the medium was 
collected. The AChBP expression level of the various cultures was determined by the 
5 analyses of a fraction of the collected medium with SDS-polyacrylamide gel 
electrophoresis (see suppliers manual; Pichia Expression Kit version 3.0, Invitrogen). 
The cultures that yielded the highest level of AChBP expression were selected and 
stored as glycerol stocks. 

Recombinant AChBP that contained a C-terminal His-tag was isolated and purified 
1 0 from the Picha pastoris medium using Talon metal affinity resin (according to protocol 
as described within the user manual; Clontech laboratories Inc.). The protein 
concentration was. subsequently analysed using SDS-polyacrylamide gel 
electrophoresis and reference marker proteins. Polyclonal antibodies have been 
raised succesfully to the recombinant L-AChBP_T1 and B-AChBP_T1 proteins in 
1 5 Balb-C mice. Immune-sera were obtained without crosslinking of the proteins. 

Binding characteristics of AChBP: 

First the binding curve of a-Bungarotoxin to His-tagged AChBP was determined, and 
an affinity of 3.5 nM was calculated. Using a-Bungarotoxin in a competitive binding 

20 assay ligands of several types of ligand-gated ion channels were then tested on His- 
tagged AChBP, i.e., ACh, serotonin, GABA, glycine, and glutamate. Both ACh and 
serotonin did compete with a-Bungarotoxin binding at 4.2 mM and 269 mM, IC50s 
respectively. GABA, Glycine and glutamate did not compete for binding with a- 
Bungarotoxin. Thus, as predicted by the primary sequence and by subunit structure 

25 also the ligand-binding characteristics of AChBP resembled that of a nAChR. 

In a second series of competitive binding assays the ligand binding characteristics of 
AChBP were studied in more detail, now using various agonists and antagonists of 
the AChRs. Nicotine a classical agonist of the nAChRs, is a high affinity ligand of His- 
tagged AChBP (IC50 98 nM). Epibatidine, a high affinity agonist of the nAChRs, also 

30 binds with high affinity to His-tagged AChBP (IC50 1.4 nM), which is even higher 
than the 58 pM affinity of epibatidine reported for the nAChR (Badio, Mol. Pharmacol. 
45 (1994), 563-569). Other cholinergic agonists bind with a lower affinity e.g., 
decamethonium, carbachol, and choline respectively with IC50s of 4.1 [xM, 43 \iM. 
and 190 pM. Summary of affinities indicated in Table 2. 

35 



WO 01/58951 



Page 55 of 259 



WO 01/58951 PCT/EP01/01457 

-54 

Table 2 





IC50 (jxM) 


nHill 




IC50(|XM) ' 


nHIII 


serotonin 


269 ± 67 


0.65 ± 0.03 


a-cobratoxln 


16.2 ±0.1 


4.08 ± 0.30 


choline 


190±32 


0.91 ± 0.20 


atropine 


5.25 ± 0.49 


1.91 ±0.23 


carbachol 


43 ±2.7 


0.67 ±0.05 


decamethonium 


4.1 ±0.3 


1.13 ±0.09 


acetylcholine 


4.2 ±1.1 


0.72 ±0.09 


physostigmine 


1.25 ±0.04 


0.66 + 0.07 


nicotine 


0.098 ±0.025 


0.78 ±0.05 


d-tubocurarine 


0.093 ±0.003 


0.83 ±0.04 


epibatidlne 


0.0014 ±0.0001 


0.66 ± 0.04 


gallamine 


0.039 ±0.007 


0.71 ±0.14 








a-bungarotoxin 


0.0026 ± 0.0006 


0.80 ±0.18 



5 Competition-binding of typical antagonists of the nAChRs, e.g., tubocurarine and cc- 
Bungarotoxin, have a high affinity for His-tagged AChBP, respectively IC50s of 93 
nM and 2.6 nM. The cholinergic antagonist succinylcholine has a very low affinity for 
His-tagged AChBP (IC50 7.9 mM). Interestingly, also muscarinic receptor 
antagonists bind to His-tagged AChBP with relatively high affinity, e.g., the 

10 muscarinic allosteric modulator gallamine (1C50 39 nM), and the muscarinic 
antagonist atropine (IC50 5.3 mM). Physotigmine which is a known blocker of 
acetylcholinesterase and is also an antagonist of the nAChR, binds to His-tagged 
AChBP with an IC50 of 1 .3 mM. 

Finally, Bipinnatin-B was tested, a synthetic form of the coral lophotoxin on AChBP 
15 (Groebe and Abramson, J. Biol. Chem. 270 (1995), 281-286). Bipinnatin-B is a 
general blocker of nAChRs and is known to covalently bind to TyM 90 of the a 
subunits (Abramson, J. Biol. Chem. 263 (1988), 18568-18573). His-tagged AChBP 
was incubated with the toxin, and the mass of the protein increased with 430.1 Da, 
corresponding well to the calculated mass of Bipinnatin-B of 431 Da, indicating that 
20 the toxin also binds to Tyr-1 84 in His-tagged AChBP. 

EXAMPLE 5: Expression and purification of recombinant AChBP for 
crystallization 

The AChBP_T1 protein from Lymnea stagnalis (AChBP) was overexpressed in 
25 Pichia pastoris GS115 strain using the AOX1 gene expression system from 
Invitrogen. Media and methods used for AChBP expression are also described in 
Invitrogen manual Pichia Expression Kit. For long term storage the transformants 
were grown overnight in YPD medium at30°C. 
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YPD or Yeast Extract Peptone Dextrose medium 

1 % yeast extract (Difco) 

2% peptone (Difco) 

2% dextrose (glucose) (Merck) 

The cells were harvested and suspended in YPD medium containing 15% glycerol at 

final OD600 of -50. The cells were frozen in a dry ice/ethanol bath and stored in the 

freezer (Revco) at -80°C. Normally, the expression of AChBP started with plating the 

cells from the glycerol stock on MD plate. 

MD or Minimal Dextrose Medium 

1 .34% YNB (yeast nitrogen base w/o amino-acids) (Difco) 

4x10-5 % d-biotin (Sigma) 

1 % dextrose 

For plates add 1 5g of agar (Difco) 

The plate was stored in the incubator (Heraeus) for 3-4 days at 30°C. A single colony 

was picked from the plate and inoculated in 150 ml baffled flask (Nalgene) containing 

25 ml of BMGY medium. 

BMGY or Buffered Glycerol-complex Medium 

1 % yeast extract 

2% peptone 

100 mM potassium phosphate (pH 6.0) (Merck) 
1.34% YNB 
4x1'0-5 % d-biotin 
1% glycerol (Merck) 

The culture was placed into the shaker (New Brunswick) and left to grow overnight 
rotating at 250 rpm at 30°C. The following day 12.5 ml of the culture was inoculated 
Into 225 ml of BMGY medium in a 1000 ml baffled flask. In order to increase the yield 
of expressed AChBP a larger number of flasks were used, usually 16. The flasks 
were placed in the shaker and start-cultures were rotated at 250 rpm at 30°C. After 
two days the start-cultures were centrifuged for 15 min at 2500 rpm (Sorvall RC3B+, 
rotor H-6000A) at room temperature. In order to increase the cell mass for bigger 
protein production, cell pellets of two start-culture flasks were pooled together and 
resuspended in 200 ml of BMMY medium containing 1% (w/v) casamino acids. 
BMMY of Buffered Methanol-complex Mediums 1% casamino acids 
1% yeast extract 
2% peptone 

100 mM potassium phosphate (pH 6.0) 
1.34% YNB 
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4x10-5 % d-biotin 
0.5% methanol (Merck) 
1% casamino acids (Difco) 

The cultures were put back into the shaker (250 rpm, 30°C) and induced for the 
5 following 4 days. The concentration of methanol in the medium was kept constant by 
adding 1% (v/v) methanol to the cultures every 24 hours. After 4 days 100 ml of 
culture was harvested and the original volume of 200 ml was readjusted by adding 
fresh BMMY medium with 1% casamino acids. The remaining cultures were induced 
for another 4 days. The harvested cultures were centrifuged for 15 min at 4000 rpm 
10 (Sorval) RC3B+, rotor H-6000A) and the cell pellet was discarded. The supernatant 
was first filtered through a 0.22 (Jm filter (Millipore) to remove any remaining cells and 
it was concentrated using a Minitan system (Waters/Millipore) with 30kDa cutoff filter 
(Waters/Millipore). Both the filtration and concentration and were performed at 4°C. 
Finally, cent rifugat ion at 16000 rpm was done (Sorvall RC5C, rotor SS-34) in order to 
15 remove any debris left after the first two steps. The final volume of concentrated 
sample was -80 ml and it was dialyzed overnight against 2 x 5 I (20 mM Tris [pH 
8.0], 150 mM NaCI and 0.02% NaN3) using 15kDa cutoff dialysis membrane 

(Spectra/Por) at 4°C. The dialyzed protein solution (-100 ml) was loaded onto an 
anion-exchange column (POROS 50 HQ, Pharmacia, column volume 8 ml). After the 

20 initial wash step of -15 column volumes using loading buffer, a salt gradient of 30 
column volumes was run from 150 mM to 1000 mM NaCI. Both solutions contained 
also 20 mM Tris (pH 8.0) and 0.02% NaN3. The peak of interest eluted at -300 mM 
NaCI (conductivity range 16-24 mS/cm). The presence of AChBP was checked by 
Bio-Rad Protein Assay (Bio-Rad) and SDS-PAGE and the fractions of interested 

25 were pooled and concentrated using a Centriprep with a 30kDa cutoff membrane 
(Amicon). The concentrated sample (volume of 5 ml) was loaded onto a gel filtration 
column (Superdex 200 HR 16/60, Pharmacia, column volume 120 ml) using 20mM 
Tris (pH 8.0), 150 mM NaCI and 0.02% NaN3. The protein eluted starting from 60 to 
71 ml with peak at -66 ml. The final purification step of the protein was done on an 

30 anion-exchange column (MonoQ HR10/10, Pharmacia, column volume 6 ml). The 
protein was loaded onto the column in the same buffer as eluted from the gel filtration 
column. The salt gradient used for the column was the identical to the one used for 
the POROS 50 HQ column. The fractions in the conductivity range 25-27.5 mS/cm 
were pooled together and dialyzed against buffer containing 50 mM HEPES (pH 7.0) 

35 and 0.02% NaN3. The protein was concentrated up to -20 mg/ml using a Centricon 
with a 30kDa cutoff membrane (Amicon). The total yield was about 2 mg purified 
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protein per liter of expressed medium. The concentrated protein was stored at 4°C 
and used for crystallization experiments and biochemical characterization. N-terminal 
sequencing revealed the presence of EAEAYVEF residues that are part of the plC9- 
encoded signal sequence, before residue 2. The experimental mass was determined 
5 to be 26544 Da (MALDI), which is ~2kDa more than calculated mass based on 
amino-acid sequence (24649 Da). The difference is assigned to glycosylation of 
AChBP at position Asn66 in the mature sequence, confirmed by deglycosylation 
experiments with N-glycosidase F (Boehringer). 

The purification of the first harvest was done separately from the full harvest. They 
10 were pooled together prior to the last purification step (anion-exchange 
chromatography step on MonoQ column). All above mentioned chromatography 
columns were mounted on an FPLC system (Pharmacia) controlled by the UNICORN 
system (Pharmacia). All solutions used in the FPLC system were prepared with 
MilliQ UF+ water, filtered through 0.22 pm filter (Millipore) and degassed. 

15 

EXAMPLE 6: Crystallization of the AChBP 

All the crystallization experiments were done by vapor diffusion technique in a 
hanging drop mode using 12 well tray (Nelipak) and siliconized cover slides 
(Hampton Research). Trays were placed in a sandwich box (Semadeni) and stored 

20 at 19°C temperature conditioned room. The initial crystallization attempts were 
performed using Hampton Crystal Screen I and II (Hampton Research). Drops 
contained 2 pi of protein (10 mg/ml in 50 mM HEPES [pH 7.0] and 0.02% NaN 3 ) and 
2 pi of reservoir solution. From the first screen it became clear that AChBP makes 
crystalline precipitate in the presence of CaCI 2 salt. A more detailed screen was 

25 made which produced crystals suitable for X-ray analysis. The AChBP crystals 
appeared in the following conditions: 9-11% (w/v) PEG 4000 (Hampton Research), 
100 mM HEPES (pH 7.0), 50-200 mM CaCI 2 x 6H 2 0 and 0.02% NaN 3 or or PEG 
MME 550 10-18% in the same conditions, with 0.3 mM ZnAcetate as additive. 
Depending on the batch of the protein used and the CaCI 2 concentration three 

30 different crystal forms were found: orthorhombic, tetragonal and monoclinic. Both 
orthorhombic and monoclinic crystal forms are frequently twinned. Orthorhombic rod- 
like crystals appeared immediately upon setting up the crystallization experiments (in 
between first few hours) under high [CaCy. The size of the crystals varied from 
0.05x0.05x0.15 to 0.25x0.25x1.0 mm. The crystals diffract X-ray up to 3 A resolution 

35 and show high degree of mosaicity (-05-1.2°). They have the symmetry of space 
group P2 A 2& with cell constants of a= 120.62A, b=137.0lA, c=161.54A with 2 
pentamer molecules per asymmetric unit. Tetragonal crystals, squared in shape, 
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grew at lower CaCI 2 concentration, reaching 0.2x0.3x0.35 mm in size. The maximal 
resolution obtained was 2.7 A with a lower mosaicity (0.5°). They belong to space 
group P4 2 2i2 space group with cell dimensions of a=b=141 .66A, c=120.83A with one 
pentamer molecule per asymmetric unit. The exact crystallization condition for the 

5 tetragonal crystal which was used for refinement of the crystal structure: 1 1.5% (w/v) 
PEG 4000, 100 mM HEPES (pH 7.0), 150 mM CaCI 2 and 0.02% NaN 3 . The third 
crystal form, monoclinic P2 U is very similar in morphology to the orthorhombic 
crystals with cell dimensions of a=121.lA, b=162.lA, c=139.4A, p=90.13°, containing 
4 pentamers per asymmetric unit. This crystals were gave lower resolution data (~ 

10 3.3A resolution). All three crystal forms were used in the structure determination of 
AChBP. 

The resolution limit of diffraction depended very much on the size of the crystals. And 
the largest crystals diffracted weakly to ~4A resolution when exposed to a 
conventional rotating anode X-ray source. Therefore, the use of synchrotron radiation 

15 was critical for the structure determination. The crystals had to be cryo-protected in 
order to slow down the damage caused by high intensity synchrotron radiation. The 
cryo-protection of the AChBP crystal was done in multiple steps. The first steps 
included the stabilization of crystal by adding the 2 pi of mother liquor (equilibrated 
reservoir solution) to the drop with the crystal. After 5 minutes 3 pi of stabilizing 

20 solution was added to the drop. Normally, the stabilizing solution contained slightly 
higher concentrations (1-5%) of the components of the original crystallization buffer. 
As protectant glycerol (Merck) was added, increasing the concentration stepwise 
from 0% to 30% (v/v). For example, the starting solution contained 15% PEG 4000, 
100 mM HEPES (pH 7.0), 150 mM CaCI 2 and 0.02% NaN 3 and the final solution 

25 contained 30% (v/v) glycerol in addition to the components just mentioned. The 
AChBP crystals do not tolerate drastic increase in the glycerol concentrations 
therefore a gentle but more time consuming approach has to be adopted. The 
solution around the crystal has to be stepwise exchanged (usually 5% increase of 
glycerol concentrations) allowing crystals to equilibrate for at least 5 minutes in each 

30 glycerol concentration. Once the crystals were equilibrated in stabilizing solution with 
30% glycerol they were flash-cooled in liquid nitrogen or in the cryo-stream. In all 
three space groups AChBP forms a decamer structure with perfect 52 symmetry, 
where two pentamers contact each other through a calcium-binding site, at the top 1 
of the ct1 helix. This binding site (Asp2 and Asp5 from two monomers) is not 

35 conserved in the LGIC family. In the tetragonal space group the 2-fold of the 
decamer coincides with a crystallographic two-fold, which leads to 
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pseudocentrosymmetric behavior of the phases at low resolution. In solution the 
AChBP protein acts as pentamer. 

Those of skill in the art will appreciate that the aforesaid crystallization conditions can . 

5 be varied. Such variations may be used alone or in combination, and include final 
protein (optionally in complex with a ligand) concentrations between 1 mg/ml and 30 
mg/ml; all combinations of AChPB/ligand to precipitant ratios; use of citrate 
concentrations between 0 mM and 200 mM; DTT concentrations between 0 mM and 
10 mM; and any concentration of beta-mercaptoethanol; pH ranges between 5.5 and 

10 9.5; PEG concentrations between 5% and 25% (w/v); PEG weights between 2000 
and 8000; HEPES concentrations between 5 and 500 mM; use of TRIS or other 
solutions instead of HEPES, and any concentration or type of detergent; any other 
type of precipitating agent; any other buffer; any temperature between -50 fi C and 30 
S C; and crystallization of AChBP or complexes thereof by batch, liquid bridge, or 

1 5 dialysis method using these conditions or variations thereof. 

EXAMPLE 7: Structure determination 

The crystal structure was determined using the multiwave anomalous dispersion 
(MAD) technique on a Pb derivative, but non-crystallographic symmetry (NCS) 

20 averaging was necessary to obtain interpretable electron density. Collection of native 
data and heavy-atom derivatives were carried out at the synchrotron beam-lines in 
Grenoble (ESRF/BM14 and ID14) and Hamburg (DESY/BW7A, BW7B and X1 1). The 
AChBP orthorhombic crystal was soaked in stabilizing solution containing 5 mM 
trimethylleadacetate (MePb) for 5 days. Data sets were collected at four different 

25 wavelengths (0.9492A, 0.861 OA, 0.9507A and 0.9499A) and data were integrated 
and reduced using DENZO/SCALEPACK (Otwinowski and Minor (1997) Processing 
of X-ray diffraction data collected in oscillation mode. In Methods in Enzymology, 
Volume 276: Macromolecular Crystallography, part A. C.W. Carter and R.M. Sweet, 
eds. (New York: Academic Press), pp. 307-326). The program SOLVE (Terwilliger 

30 (1997) SOLVE: An automated structure solution for MAD and MIR. Edition 1.16) 
found 5 Pb sites which were situated on the interface between two pentamers. The 
Pb parameters were refined and phases calculated with SHARP (La Fortelle et al. 
(1997) Advances in MIR and MAD phasing: Maximum-likelihood refinement in a 
graphical environment, with SHARP. Proceedings of the CCP4 study weekend). 

35 Mean figure of merit (FOM) value for 4 wavelengths was 0.45. Search and 
optimization of 5-fold NCS operators were done using programs NCS6D and IMP 
(Kleywegt and Jones (1999) Software for handling macromolecular envelopes. Acta 
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Crystallo., D55, 941-944). 10-fold averaging using refined NCS operators in 
conjunction with density modification by DM (Cowtan (1994) DM: An automated 
procedure for phase improvement by density modification. In Joint CCP4 and ESF- 
EACBM Newsletter on Protein Crystallography 31, 34-38) yielded an interpretable 

5 electron density map. However, some parts of the pentamers were still not clearly 
defined. Therefore, a second MAD experiment was performed on the monoclinic 
crystals soaked in 10 mM MePb for 5 days. Data were collected for only two 
wavelengths, at the Pb peak (0.9479A) and remote (0.9498A) wavelength. The 
processing of the two collected data sets was done with MOSFLM (Leslie (1992) 

10 Recent changes to the MOSFLM package for processing film and image plate data. 
In Joint CCP4 and ESF-EACBM Newsletter on Protein Crystallography, Number 26) 
and data were scaled with SCALA (CCP4. The CCP4 suite: programs for protein 
crystallography. Acta Crystallog. D50, 760-763). 10 Pb sites were identified with 
Solve. The Pb parameters were refined and phases calculated with SHARP in single 

15 anomalous dispersion (SAD) mode using data collected at the Pb absorption peak 
The NCS operators needed for 20-fold averaging were found by NCS6D and 
improved with IMP. 20-fold averaging and density modification by program DM 
further improved electron density. The initial model tracing and sequence assignment 
were done based on the 20-fold averaged electron density with program O (Jones et 

20 al., 1991). However, parts of the molecules were not clearly defined. The electron 
density was further improved doing multi-crystal averaging with DMMULTI (Cowtan, 
1994) using amplitudes of tetragonal, orthorhombic and native data sets and 
experimental phases of the orthorhombic and monoclinic MAD experiments. Initially 
missing parts became clearly defined and a complete model could be built. The initial 

25 atomic model was refined with the program CNS (Brunger et al. (1998) Acta 
Crystallogr. D 54, 905-921) against a maximum-likelihood target without experimental 
phases contribution using tetragonal native data which extend to 2.7A resolution. 
Refinement included five-fold NCS restraints, an overall anisotropic B factor and bulk 
solvent correction. The five-fold NCS restraints were released for the parts of the 

30 pentamer that clearly do not follow the five-fold symmetry. The current model 
contains one pentamer of AChBP consisting of 1035 residues, 14 well-ordered 
solvent molecules, 5 Ca 2+ ions, 5 CI" ions and 5 Hepes molecules, well-ordered 
solvent molecules and 5 HEPES molecules. The following residues are not well 
defined in the electron density: -8-0 (part of a-mating S. cerevisiae signal sequence 

35 not native to AChBP EAEAYVEF; SEQ ID No. 21), 125-135, 155-165, 186-191 and 
206-210. 
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Electron density is detectable in the ligand-binding site of AChBP. It is presumed that 
a HEPES molecule could account for this extra electron density based on its 
chemical properties. HEPES or N-2-Hydroxyethylpiperazine-N'-2-ethanesulfonic acid 
contains a quaternary ammonium ion similar to ligand such as acetylcholine (ACh) 

5 and d-tubocurarine. It has been proposed that the binding of ACh would be mediated 
by cation-it interaction involving N + and u-systems of aromatic residues present in the 
binding site of nicotinic acetylcholine receptor. Without intending to be bound by 
theory it is suggested in accordance with the observation of the present invention that 
the observed HEPES molecule mimics ligand binding analogous to the binding of 

1 0 natural ligands like ACh in the ligand-binding site. 

EXAMPLE 8: More detailed description of the structure determined in Example 7 

As described in the previous example, the crystal structure of AChBP was solved 
using weak Pb MAD data in two crystal forms. The electron density map was 
15 improved substantially by cross-crystal averaging of three crystal forms with 20, 10 
and 5 copies of the monomer in the asymmetric unit respectively (Table 3). 



Table 1: Data collection statistics 

Data set ^ peak X, remote Xainfl. id peak \gjnf\- Native 

20 Space group P2,2& P2\ 

Resol.(A) 3.3/3.4-3.3 3.0/3.1-3.0 2.7/2.8-2.7 

\(A) 0.9492 0.8610 0.9507 0.9479 0.9498 0.943 

Compl.(%) 99.7/99.7 99.6/99.6 99.7/99.7 99.9/99.9 99.5/99.5 97.8/96.5 

MosaicityO 0.62 0.43 0.78 

25 Redundancy 3.7/3.8 3.8/3.9 3.7/3.8 3.5/2.2 3.2/2.0 6.5 

7.7/46.8 7.8/45.2 8.3/55.0 5.9/26.1 6.0/32.9 5.9/67.4 

Vol 8.7/1.6 8.4/1.7 8.3/1.4 7.7/2.7 6.8/1.5 27.4/2.3 
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ISO/ANO ISO/ANO ISO/ANO 



ISO/ANO ISO/ANO 



Rcullis {%) 



0.74/0.89 n.a./0.92 0.54/0.94 



n.ayo.74 0.66/0.77 



Phasing power 0.57/1 .2 n.a./1 .06 2.3/0.91 



n.a./1.1 



0.37/1 .22 



FOM (overall) 0.45 



0.28 



10 



15 



20 



25 



30 



The structure was refined at 2.7 A in space group P4 2 2 1 2, with one AChBP pentamer 
in the asymmetric unit. Thus, native data (Xt 1 ) were collected and the Pb-1 data sets 
(BW7A) at the EMBL/DESY synchrotron in Hamburg and the Pb-2 data sets (BM14) 
at the ESRF, Grenoble (Table 3). Data were processed with DENZO/SCALEPACK 
(Otwinowski & Minor, Methods Enzymol. 276, 307-326 1997) (native) or MOSFLM 
(Leslie, Acta Crystallogr. D. Biol. Crystallogr. 55, 1696-1702, 1999)/SCALA (CCP4) 
(Pb-1, Pb-2). The Pb sites, located at the interface of two pentamers, were found for 
both MAD sets by SOLVE (Terwilliger, Acta Crystallogr. 55, 849-861, 1999) and 
heavy atom parameters were optimized with SHARP (La Fortelle et al., Methods 
Enzymol. 276, 472-494, 1997). NCS operators were found and refined with NCS6D 
and IMP (Kleywegt and Jones, SERC Daresbury Laboratory, Warrington, pp. 59-66, 
1994). DM-muIti (Cowtan, Joint CCP4 and ESF-EACBM Newsletter on Protein 
Crystallography, 31, 34-38, 1994) multi-crystal averaging used amplitudes of 
monoclinic, orthorhombic and native (tetragonal) data sets and experimental phases 
of the orthorhombic and monoclinic MAD experiments. The model was built in O 
(Jones et al., Acta Crystallogr. A47, 110-119, 1991) and refined with the program 
CNS (Brunger et al., Acta Crystallogr. D54, 905-921, 1998), against the tetragonal 
data to 2.7 A resolution. Refinement included partial 5-fold NCS restraints, an overall 
anisotropic B factor and bulk solvent correction. The unusual double cysteine 
Cys187-Cys188 formed a clear disulfide bridge. Because of the limited resolution it 
was refined with standard parameters. The final model contains 1025 residues of 
AChBP pentamer, 5 HEPES molecules, 10 Ca 2+ ions and 15 water molecules. The 
entire AChBP pentamer is well ordered, except for the N-terminal 7 residues (part of 
the signal sequence) and the last five C-terminal residues. In addition, the HEPES, 
the loop region 155-160 and the sugar residues attached to residue Asn66 are not 
well resolved in the electron density. R.m.s deviations from ideal geometry for bond 
distances and angles are 0.01 A and 1.6°, respectively. The sequence alignment was 
calculated by CLUSTALX (Thompson et al., Nucleic. Acids. Res. 25, 4876-4882, 
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1997) and the corresponding figure with Espript (Gouet et al., Bioinfomnatics. 15, 
305-308, 1999). Figures 2-5 were done using programs MOLSCRIPT (Kraulis, P.J., 
J. Appl. Cryst. 24, 946-950, 1991), BOBSCRIPT (Esnouf, Acta Crystallogr. D55, 938- 
940, 1999) and RASTER3D (Merritt and Bacon, Methods Enzymol. 277, 505-524, 
5 1997). Refinement took place with partial five-fold NCS restraints, resulting in an R- 
factor of 26.4% (R fre e = 30%). 

The AChBP pentamer; 

The AChBP homopentamer, when viewed along the five-fold axis, resembles a 
10 windmill toy, with petal-like monomers (Fig. 6a). When viewed perpendicular to the 
five-fold axis it has a disc-like appearance (Fig. 6b). The overall proportions of the 
pentamer are -80x80x62 A, and the diameter of the central hole is -18 A. These 
dimensions are in good agreement with the Torpedo nAChR N-terminal domain EM 
data (Miyazawa et al., J. Mol. Biol. 288, 765-786, 1999). The only subunit contacts in 
15 the AChBP pentamer are dimer interfaces, of which each monomer has two, one 
called the plus side and one called the minus side. We refer to the A (plus)-B (minus) 
interface, as example for the five equivalent interfaces AB, BC, CD, DE and EA (Fig 
6). 

20 The AChBP monomer: 

Each AChBP monomer is a single domain protein, asymmetric in shape, with a size 
of -50x21x27 A (Fig. 12a). It consists of an N-terminal p-helix, two short 3 i0 helices 
and a core of 10 p-strands forming a p-sandwich. The order of p-strands conforms to 
a modified immunoglobulin (Ig) topology (Fig. 12b) with an extra p-hairpin (f'-f") and 

25 an extra strand (b*) (Bork et al., J. Mol. Biol. 242, 309-320, 1994). These additional 
strands introduce two so-called "Greek key" folding motifs. The Ig-based structure 
prediction (Le Novere et al., 1999; Corringer et al., Biophys. J. 76, 2329-2345, 1999) 
agrees well with the AChBP structure, although location of the binding site was 
missed due to the presence of extra p-strands (Fig. 12b). Compared to the classical 

30 Ig-fold, the AChBP p-strands are considerably twisted, with the p-sheets rotated 
against each other, resulting in two separate hydrophobic cores. Thus the three- 
dimensional fold does not resemble other Ig-like proteins and comparison to the 
protein database (Holm and Sander, Nucleic. Acids. Res. 25, 231-234, 1997) did not 
result in a significant match to any known structure. 

35 
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Positioning of functional regions: 

Couple of regions that are important to receptor function can be localized in the 
5 AChBP structure. In muscle type nAChRs the main immunogenic region (MIR), 
comprising residues ^67-^76, acts as an epitope in the autoimmune disease 
myasthenia gravis (Tzartos et al., Mol. Neurobiol. 5, 1-29, 1991). Although the MIR- 
related region in AChBP (residues 65-72) shows no sequence homology to the ar 
subunit, its location in loop L3 at the top of the pentamer in a highly accessible 

10 position agrees well with the expected accessibility for this region. It also fits with EM 
studies that located the MIR at the distal end of the receptor relative to the 
membrane (Beroukhim and Unwin, Neuron 15, 323-331, 1995). 
On each AChBP monomer, a large cavity that is accessible from the central pore of 
the pentameric ring can be seen. The cavity is framed at the entrance by p-strands 

15 03, p4, p5 and p5') (Fig 12a) and is uncharged, mainly hydrophobic, in character. 
This region probably corresponds to the tunnel framed by twisted p-strands that was 
observed in the a r subunit of Torpedo receptor at 4.6 A resolution (Miyazawa et al., 
J. Mol. Biol. 288, 765-786, 1999). However, this cavity is not in contact with another 
large pocket observed at each interface between subunits. These latter pockets are 

20 lined by residues shown to be involved in ligand binding in nAChR (Arias, 
Neurochem. Int. 36, 595-645, 2000; Corringer et al., Annu. Rev. Pharmacol. Toxicol. 
40, 431-458, 2000). They are buried from the solvent, and located close to the 
outside of the pentameric ring. When viewed perpendicular to the five-fold axis they 
are roughly equatorially positioned, -30 A away from the C-termini (Fig. 6b), 

25 conforming to the expected location of the Torpedo receptor ligand-binding site, as 
determined by labeling (Fernando Valenzuela et al., Biophys. J. 66, 674-682, 1994) 
and EM studies (Unwin, J. Mol. Biol. 229, 1101-1124, 1993). 

The ligand-binding site: 

30 Each ligand-binding site is found in a cleft formed by a series of loops from the 
principal face of one subunit and a series of p-strands from the complementary face 
of an adjacent subunit. It is a large cavity buried by a series of loops from the 
principal side and by a p-strands from the complementary side (Fig. 13). The 
principal side on the plus side of the AB interface consists of residues coming from 

35 'loop A' (TyrA89), loop B' (TrpA143, A145) and loop C (TyrA185, the double 
cysteine A187-A188, and TyrA192) (Fig 13c). The complementary part of this binding 



WO 01/58951 



Pag e 66 of 259 



WO 01/58951 PCT7EP01/01457 

-65 

site Is contributed by monomer B and made of 'loop D' (TrpB53, GlnB55), 'loop E 
(ArgB104, ValB106, LeuB112 and MetB114) and 'loop F (TyrB164) (Fig 13d). In this 
pocket four of the aromatic residues form the bottom half of the cavity (TyrA89, 
TyrA185, TyrB164 and TrpB53). The pocket walls are formed by the TyrA192, 
5 TrpA143, main chain of A145 Met B114, the side-chain of GlnB55 and the double 
cysteine (CysA187-CysA188). The hydrophobic parts of ArgB104, ValB106 and 
LeuB1 12 form the top of the pocket (Fig 13a).' 

All residues in the pocket had been successfully identified by photoaffinity labeling 
and mutagenesis studies (Arias, Neurochem. Int. 36, 595-645, 2000; Corringer et al., 

10 Annu. Rev. Pharmacol. Toxicol. 40, 2000). Although the side chain of HisA145 is 
pointing away from the cavity, its main chain is involved. One residue identified by 
labeling studies, TrpA82 (ctiTrp86) (Galzi et al., J. Biol. Chem. 265, 10430-10437, 
1990; Dennis et al., Biochemistry 27, 2346-2357, 1988) is involved in hydrophobic 
core formation and located far from the pocket, thus not participating in ligand 

15 binding. Otherwise, the AChBP ligand-binding site confirms the available biochemical 
and mutational data on nAChR completely. 

The structure, however, shows for the first time how these residues are positioned 
with respect to each other and therefore provide a valuable tool for drug design as 
20 described in the above description of the present invention. 

All observed residues are conserved between known nicotinic ligand-binding 
subunits except the loop F TyrB164 residue. The loop F region has an unusual 
conformation, but since it is relatively weakly resolved, its precise analysis is difficult. 

25. The 'loop F region is stabilized in the structure by a calcium binding site formed by 
AspB161, AspB175 and the main chain of B176. This Ca 2+ ion is structurally 
important for TyrB164 orientation and could therefore be important for proper ligand 
binding. The present findings are supported by labeling studies on muscle/Torpedo 
subunits showing that residues homologous to AspB161, yAsp174/5Asp180 play a 

30 role in ligand binding (Czajkowski et al., Proc. Natl. Acad. Sci. U.S.A. 90, 6285-6289, 
1993; Czajkowski and Karlin, J, Biol. Chem. 270, 3160-3164, 1995; Martin et al., J, 
Biol. Chem. 271, 13497-13503, 1996). Additionally, calcium binding sites that 
enhance the response to agonist binding have been identified in the homologous 
region (residue range 161-172) of neuronal a 7 receptor (Galzi et al., EMBO J. 15, 

35 5824-5832, 1 996). The 'loop F region has low sequence conservation in the nicotinic 
family (Fig. 11) and in other superfamlly members it may well have a different 
conformation, even to the extent of forming a p-strand that connects the two sheets 
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into a p-barrel. Such changes could well lead to variations in affinity, e.g. by changing 
the size of the ligand-binding site or its access route. 



The most likely access routes to the ligand binding sites are from above or below the 
5 double-cysteine-containing 'loop C (Fig. 13a). This region buries the pocket from the 
solvent and therefore prevents access from the outside. Access from the central pore 
has been suggested in the literature (Miyazawa, J. Mol. Biol. 288, 765-786, 1999), 
but this would require major structural rearrangements at the interface, which makes 
it less likely. 

10 

Ligand binding: 

Surprisingly, features of bulky electron density were found that stacked onto Trp143 
in each ligand-binding site in the experimental cross-crystal averaged electron 
density (Fig 13b). Upon consideration we have assigned this to a HEPES (N-2- 

15 Hydroxyethylpiperazine-N'-2-ethanesulfonic acid) buffer molecule, that contains a 
positively charged quaternary ammonium group and therefore has some similarity to 
known nicotinic receptor ligands. Its EC50 is 100 mM, indicating that its binding 
under crystallization conditions (100-150 mM) is possible. Although HEPES molecule 
does not make any specific contacts with the protein, it stacks with its quaternary 

20 ammonium onto Trp143, making cation-it interactions as expected for nicotinic 
agonists (Dougherty, Science 271, 163-168, 1996) (Fig. 13b). However, due to 
limited resolution of the present data and probable low occupancy, the precise 
orientation of the HEPES molecule should be taken with some degree of reservation. 

25 It has been suggested (Changeux and Edelstein, Neuron 21 , 959-980, 1998) that the 
ligand-binding site of nAChRs could be similar to that of acetylcholinesterase (AChE). 
Although the size of the binding site is roughly similar in AChBP and AChE, the 
observed arrangement of aromatic residues is quite different. However, the stacking 
of the quaternary ammonium of HEPES, as far as it has been refined in the current 

30 AChBP structure, is similar to that of the quaternary ammonium of the 
decamezonium in AChE on the Trp84 residue (Harel et al., Proc. Natl. Acad. Sci. 
U.S.A. 90, 9031 -9035, 1 993). 

Subunit arrangement: 

35 From the location of the ligand-binding site conclusions can be drawn about the 
relative arrangement of subunits in the Torpedo and muscle receptors. It has been 
suggested that the cliy and cti8 interfaces occur in a clockwise dyaiSPi arrangement 
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when looking towards the membrane (Machold et al., Eur. J. Biochem. 234, 427-430, 
1995). Such a clockwise arrangement disagrees with the structure determined in 
accordance with the present invention, because the relative arrangement of the 
principal binding site and its complementary partner is anticlockwise when looking 
5 towards the 'bottom' (membrane) side of the pentamer (Fig. 6). 



Pentamer interface: 

The subunit interface consists on the plus side entirely of loop regions (L1 , L2, L4, 

10 L5, L7, L8 and L10), whereas the minus side mostly presents secondary structure 
elements to the interface (al, p1, p2, 03, p5, 06 and L9) (Fig. 14). Several residues 
are important for both ligand-binding and pentamer formation. The interface buries a 
considerably surface area (2700 A 2 ), with a mainly uncharged character including 
only a single bifurcated salt bridge (GluA149-ArgB3 and ArgB104). Most intriguing 

15 about the interface residues is the lack of conservation of these particular residues in 
the entire superfamily, not only with AChBP, but also amongst each other (Fig. 11). 
These changes involve major changes in character, including changes from 
hydrophobic to charged. Even when a residue is conserved in any particular subunit, 
its expected counterpart is missing (either in the same subunit, as in the ct 7 

20 homopentamers, or in contacts such as muscle a-| 5 or y or neuronal (X4P2) with the 
sole exception of the ligand-binding site. The high level of structural conservation 
however, determines involvement of the same topological regions in these contacts 
in all family members (Fig 14b). This indicates that shape complementarity must play 
a major role in determining the conservation of the pentamer structure. It also 

25 indicates that different combinations of subunits will have different interfaces, 
creating variations in the precise allosteric contacts and movements in the various 
subclasses of these ion channels. 

Liaand-qated ion channels: 

30 The lack of conservation of the interface residues seems a general feature in the 
superfamily of LGICs, as the residues that form the interface are among the least 
conserved regions of the domains (Fig. 11). Apparently pentamer formation does not 
impose very stringent evolutionary requirements in this case. However, there is clear 
sequence conservation within the superfamily (Fig. 1 1) and it is interesting to analyze 

35 this in the light of the structure. 



WO 01/58 951 



Pag e 69 of 259 



WO 01/58951 PCT/EP01/01457 

-68 

In the AChBP monomer structure the conserved hydrophobic residues can be 
grouped into three clusters (Fig. 15). In AChBP, as in other proteins with Ig-like fold, 
the packing of the p-sheets is promoted mainly by hydrophobic and to a lesser extent 
by electrostatic interactions. The first cluster is involved in packing of the N-termlnal 
5 helix a1 against the main framework of the monomer and it involves residues 6, 10, 
63, 65, 71 , 81 , 105 and 111. The second cluster, comprising residues 20, 27, 29, 31 , 
58, 82, 84, 86, 140, 150, 152 and 195, is situated in the upper half of the (3-core 
region. The third cluster, including residues 33, 35, 38, 41, 48, 52, 125, 138, 171, 
173, 199 and 201, is located at the lower end of the structure (Fig 15). The only non- 
10 hydrophobic residues that are highly conserved in the superfamily are Asp60, Asp85, 
Asn90 and Gly109. Asp60 and Gly109 are involved stabilizing the turns of a Greek 
key motif connecting strands p3, P5, p6 and p2, where Asp60 stabilizes the N- 
terminus of a small 3 10 helix and Gly109 enables tight turn formation. Conserved 
residues Asp85 and Asn90 are involved in packing of the p-sheets. Asp85 forms 
15 hydrogen bonds to the highly conserved Ser142 and Thr144 and residue Asn90 
brings together the main-chain oxygens of Ser122 and Arg137, enabling disulfide 
bond formation of the nearby absolutely conserved disulfide bond (123-136). This 
disulphide bond is topological^ equivalent to so-called tyr° sine cornerstone' 
(Hemmingsen et al„ Protein Sci. 3, 1927-1937, 1994), which links the two p-sheets 
20 together in Ig-like proteins. This explains why in the Torpedo receptor the Cys128- 
Cys142 bond is important for both preservation of subunit conformational stability 
(Mishina et al., Nature 313, 364-369, 1985) and complete nAChR assembly (Green & 
Wanamaker, J. Neurosci. 18, 5555-5564, 1998). Since the observed overall 
structural conservation is high, it is clear that all LGIC N-terminal domains will have 
25 the same three-dimensional structure. 

In contrast to the above residues, the Cys-loop is a highly conserved hydrophobic 
region in the LGIC family but presents a totally different character in AChBP (Fig. 11). 

In AChBP, this loop is hydrophilic and is found at the bottom (membrane) side of the 
30 protein, at the dimer interface. This location and its hydrophobicity in the LGIC family 
implies that this loop could interact with the membrane or with the transmembrane 
region of the receptors, functions that are absent in AChBP. 

Since all ligand gated ion channels have intrinsically the same function, opening of a 
35 membrane pore, it is likely that the conserved regions of the protein determine this 
function. That also indicates that it is unlikely that the interface of the pentamer has a 
major role in opening the channel. It is possible that the conserved Cys-loop is 
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directly involved in transmitting this kind of information to the membrane part of the 
LGICs. Another option is that large structural changes in the p-sheet regions play a 
role in opening the channel. Indeed, the movement observed at 9 A for Torpedo 
nAChR upon agonist binding (Unwin, Nature 373, 37-43, 1995), fits well with such a 

5 suggestion. In accordance with the present invention a twisted p-sandwich would be 
observed, with two distinct hydrophobic cores and it is entirely possible that these 
cores move with respect to each other upon ligand binding. The effect of such 
movements will then be modulated by the varying subunit interfaces in the different 
subtypes of the receptor, allowing intricate specificity in the neuronal signal 

10 transmission. 

EXAMPLE 9: Ligand-binding crystallization studies 

AChBP was cocrystallized in complex with a-bungarotoxin (aBTX, Sigma). Prior to 
the crystallization experiments the stability of the complex has been investigated. 

15 Using gel-filtration chromatography (Superdex 200 HR 10/30, Pharmacia, column 
volume 24 ml) it has been found that it is possible to purify stable complex between 
AChBP and aBTX. The gel-filtration run was performed using 20 mM Tris (pH 8.0), 
150 mM NaCI and 0.02% NaN 3 . The stability of the complex was also confirmed with 
- native PAGE. The crystallization experiments were done based on the same set of 

20 conditions found to work for AChBP alone; see Example 6. A small screen was set 
up with different precipitant concentrations and various AChBP:ct BTX 
concentrations. Tiny crystals appeared in the conditions containing 10-12% PEG 
4000, 100 mM HEPES (pH 7.0), 20-80 mM CaCI 2 and 0.02% NaN 3 . The best looking 
crystals grew under above mentioned conditions when AChBP: (BTX were mixed in 

25 1:10 molar ratio. In order to check if complex indeed crystallized, crystals were 
thoroughly washed, dissolved in denaturing buffer and checked on SDS-PAGE that 
clearly showed that they contained both proteins. 

In addition, a number of small ligands were bound to AChBP in soaking experiments. 

These include: B-bippinatin (a synthetic analog of lophotoxin), acetylcholine (ACh, 
30 Sigma), d-tubocurarine chloride (Sigma), carbamylcholine chloride (CCh, Sigma), 

galanthamine hydrobromide (Sigma), epibatidine (Sigma) and nicotine (Sigma). The 

soaking solutions were made of stabilizing solutions (see Example 6) and together 

with dissolved ligands (ligands were normally dissolved in 20 mM HEPES [pH 7.0]). 

The ligand concentrations used were dependent on its binding constants, as 
35 determined by ligand-binding studies. The soaking times were different depending on 

the ligand used. After the soaking step the crystals were flash-cooled in liquid 

nitrogen. 
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EXAMPLE 10: Generating human alpha7 nAChR / AChBP chimeras 

The chimeric proteins of nAChR subunits and AChBP can be used as tools in the 
development of novel, nAChR subtype specific ligands. As a first step in developing 

5 these tools chimeric proteins have been designed and constructed in which part(s) of 
the human alpha7 nAChR were grafted into AChBP. Previous studies on the 
molecular determinants of ligand-binding by the alpha7 nAChR have identified three 
amino acid domains that compose the primary part of the ligand-binding site, further 
referred to as "loops A, B, and C". Within each of the three loops amino acid residues 

10 are present that are thought to directly interact with the ligand. Based on sequence 
conservation of the nAChR and AChBP the three possible ligand-binding loops of 
AChBP have been pin-pointed in accordance with the present invention as follows: 
loop A, Trp-101 -> Tyr-108; loop B, Trp-162 -> His-164; loop C, Tyr-204 -> Tyr-211. 
The chimeric proteins that were constructed replace either one (A, B or C) or multiple 

15 (A&B, A&C, B&C and A&B&C) of the ligand-binding loops of AChBP with the 
corresponding human alpha7 nAChR sequence. 

The loop-A domain of AChBP was replaced by the corresponding domain of the 
human alpha7 nAChR using a two-step polymerase chain reaction (PGR). In the first 
step two separate PCR amplifications (35 cycles: 94 S C;30 sec., 58 9 C; 30 sec and 

20 72 2 C; 60 sec.) yielded two halves of the chimera construct. AChBP cDNA (wild type) 
was used as template, and outer primers located either just before the start codon 
(gcgctcgagaaaagagaggctgaagctttggaccgggcagacatctt; SEQ ID No. 17) or just before 
the stop codon (cgcgaattcaagaatttcggagcgtccctt; SEQ ID No. 18) were each used in 
combination with two internal primers gtggaaaccagacattctcctctacaacgccatctcgaaacc 

25 (SEQ ID No. 19) and gaggagaatgtctggtttccacaaagagcttattggcac (SEQ ID No. 20), 
respectively. The internal primers contained a 5-tag-sequence that encoded for the 
introduced alpha7 nAChR domain. As such the two generated chimeric PCR 
products share a common tag containing a part of the alpha7 nAChR subunit In the 
second step, the two PCR products from the first round were pooled and, in the 

30 absence of primers, went through 5 rounds of PCR amplification (94 9 C; 30 sec, 54 
B C; 3 mm. and 72 2 C; 90 sec.). This allowed the two halves of the chimera to anneal 
to each other at the common alpha7 nAChR tag. The subsequent addition of the two 
outer primers and another 35 cycles of 25 PCR amplification (94 S C; 30 sec, 58 9 C; 
30 sec. and 72 9 C; 90 sec) yielded the final chimera construct. All PCR 

35 amplifications were hot-started and performed using PFU DNA-polymerase 
(Invitrogen). The loop-A AChBP/alpha7 chimera was cloned, using Xhol/EcoRI 
restriction sites in the outer primers, into the His-tag containing yeast expression 
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vector pPIC9 (Invitrogen). Validation of the construct was achieved by DNA 
sequencing. Expression of the chimera construct was achieved according to the 
Pichia pastoris protein expression protocol of Invitrogen. 

5 As described in the examples and the description, the present invention provides 
water-soluble ligand-binding proteins derived from molluscs and analogs of ligand- 
gated ion channels, crystals thereof and their use for screening ligands of ligand- 
gated ion channels. In particular, ligand-binding proteins have been identified that are 
capable of forming multimers and are amenable to crystallization. The crystall 

10 structure of one these proteins, an acetylcholine binding protein (AChBP) is provided, 
which can be used to generate 3D models of the extracellular ligand-binding domain 
of ligand-gated ion channels and thus for screening of drugs that act on these ion 
channels. Furthermore, chimeric proteins are provided that are capable of binding a 
ligand of a ligand-gated receptor, and comprising at least the amino acids of the 

15 AChBP determining solubility of the AChBP, in the same positions as in the AChBP, 
and furthermore comprising amino acids determining binding to said ligand. 
It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 

20 therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent applications, 
journal articles, abstracts, laboratory manuals, books, or other disclosures) in the 
Background of the Invention, Description, and Examples is hereby incorporated 
herein by reference. Moreover, the sequence listing is herein incorporated by 

25 reference. 
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43.706 28.669 37.788 1.00 46.40 8 
44.341 27.739 39.753 1.00 46.74 7 
43.406 26.628 39.648 1.00 48.15 6 
44.129 25.330 39.301 1.00 45.72 6 
44.973 25.415 38.074 1.00 44.95 6 
46.240 25.993 38.122 1.00 43.54 -6 
44.495 24.944 36.866 1.00 41.66 6 
47.011 26.099 36.986 1.00 42.05 6 
45.257 25.047 35.733 1.00 41.56 6 
46.521 25.628 35.790 1.00 42.24 6 
42.622 26.376 40.908 1.00 49.43 6 
42.996 26.810 42.001 1.00 49.47 8 

41.524 25.653 40.727 1.00 51.17 7 
40.657 25.250 41.823 1.00 52.06 6 
39.193 25.501 41.477 1.00 52.69 6 
38.354 25.046 42.520 .1.00 51.12 8 
40.896 23.754 41.978 1.00 52.70 6 
40.529 22.966 41.103 1.00 51.13 8 
41.543 23.369 43.070 1.00 52.97 7 
41.820 21.967 43.312 1.00 56.03 6~ 
42.017 21.693 44.803 1.00 59.48 6 
41.882 22.602 45.634 1.00 60.13 8 
43.052 21.555 42.534 1.00 53.78 6 
44.483 22.558 42.968 1.00 56.89 16 
42.342 20.443 45.143 1.00 61.91 7 

42.525 20.084 46.542 1.00 63.81 6 
42.391 18.571 46.749 1.00 65.05 6 
41.828 18.226 48.128 1.00 66.74 6 
42.113 18.973 49.103 1.00 65.03 8 
41.101 17.209 48.234 1.00 67.39 8 
43.857 20.543 47.110 1.00 63.72 6 
44.910 20.036 46.745 1.00 64.11 8 
43.794 21.495 48.031 1.00 64.49 7 
44.981 22.042 48.681 1.00 63.28 6 
44.861 23.578 48.804 1.00 62.29 6 
46.058 24.135 49.539 1.00 61.76 6 
44.735 24.194 47.429 1.00 61.51 6 
45.190 21.449 50.078 1.00 63.42 6 
46.283 21.534 50.632 1.00 63.08 8 
44.141 20.848 50.641 1.00 64.34 7 
44.218 20.252 51.981 1.00 64.43 6 
42.924 19.503 52.302 1.00.63.15 6 
42.723 18.465 51.371 1.00 58.95 8 
45.414 19.306 52.128 1.00 64.40 6 
45.636 18.420 51.299 1.00 62.89 8 
46.188 19.510 53.186 1.00 65.00 7 
47.343 18.676 53.416 1.00 67.55 6 
48.647 19.293 52.939 1.00 71.25 6 
49.725 18.717 53.130 1.00 72.65 8 
48.572 20.463 52.317 1.00 72.58 7 
49.779 21.109 51.830 1.00 73.82 6 
49.505 22.482 51.162 1.00 73.07 6 
.48.855 22.272 49.837 1.00 75.57 6 

48.625 23.359 52.065 1.00 72.32 6 
50.792 21.376 52.912 1.00 75.60 6 
51.984 21.102 52.727 1.00 76.00 8 
50.324 21,907 54.041 1.00 77.55 7 
51.241 22.287 55.107 1.00 79.65 6 
50.507 23.015 56.235 1.00 79.08 6 
51.427 23.954 57.017 1.00 80.06 6 
51.000 25.097 57.338 1.00 80.12 8 
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6 
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ATOM 


1759 


C 


SER 


B 


14 


30 . 343 


36 . 619 


6 .949 


1.00 


59 . 80 


c 

o 


ATOM 


1760 


O 


SER 


B 


14 


31 . 247 


37 .293 


6 .443 


1 . 


00 


60 .23 


8 


ATOM 


1761 


N 


ARG 


B 


15 


29 . 382 


37 .134 


7 .710 


1 . 


00 


58 . 15 


7 


ATOM 


1762 


CA 


ARG 


B 


15 


29.322 


38 . 549 


8 .040 


1 . 


00 


55.80 


6 


ATOM 


17 63 


CB 


ARG 


B 


15 


28 . 271 


39 .239 


7 .183 


1. 


00 


56. 87 


6 


ATOM 


1764 


CG 


ARG 


B 


15 


28.540 


39 . 145 


5 .684 


1. 


00 


60.72 . 


6 


ATOM 


1765 


CD 


ARG 


B 


15 


27.721 


40 . 179 


4 .945 


1. 


00 


63 .19 


6 


ATOM 


1766 


NE 


ARG 


B 


15 


28 . 008 


41.515 


5 .475 


1. 


00 


67 .21 


7 


ATOM 


1767 


CZ 


ARG 


B 


15 


27 .307 


42 . 616 


5 .196 


1. 


00 


67 . 82 


6 


ATOM 


1768 


NH1 


ARG 


B 


15 


26 .259 


42 .549 


4 .384 


1 . 


00 


69 .12 


7 


ATOM 


1769 


NH2 


ARG 


B 


15 


27 .660 


43 .789 


5 .722 


1. 


00 


66 . 84 


7 


ATOM 


1770 


C 


ARG 


B 


15 


28.962 


38 . 655 


9 .511 


1. 


00 


54 . 09 


6 


ATOM 


1771 


O 


ARG 


B 


15 


27.795 


38. 651 


9 .880 


1. 


00 


55.59 


8 


ATOM 


1772 


N 


PRO 


B 


16 


29.979 


38 .747 


10 .377 


1. 


00 


51.42 


7 


ATOM 


1773 


CD 


PRO 


B 


16 


31.405 


38.751 


10 .015 


1. 


00 


48 .17 


6 


ATOM 


1774 


CA 


PRO 


B 


16 


29 . 817 


38.846 


11 .828 


1. 


00 


48 . 94 
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ATOM 


1775 


CB 


PRO 
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16 


31.256 


38 .950 
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1. 


00 


48 .44 
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1776 


CG 


PRO 
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16 


32.038 


38 .262 


11 .284 


1. 


00 
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ATOM 


1777 


C 


PRO 
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16 


28 .974 


40 .014 


12 .300 


1. 


00 


48 .35 
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ATOM 


1778 


O 


PRO 
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16 


28.475 


40 . 006 


13 .420 


1. 


00 


49 .87 
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ATOM 


1779 
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ASP 
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17 


28.825 


41.026 


11.456 


1.00 


49.55 
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1780 


CA 


ASP 
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17 


28.048 


42.201 


11.830 


1. 


00 


51.87 
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ATOM 


1781 


CB 


ASP 
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17 


28 . 638 


43 .469 


11 .204 


1. 


00 


55. 73 


6 


ATOM 


1782 


CG 


ASP 
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17 


29 .956 


43 .896 


11 .856 


1. 


00 


59.93 
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ATOM 


1783 


OD1 


ASP 
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17 


30.158 


43 .598 


13 . 062 


1. 


00 


60 . 57 
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ATOM 


1784 


0D2 


ASP 


B 


17 


30.781 


44.548 


11 .163 


1. 


00 


61 . 01 


8 


ATOM 


1785 


C 


ASP 


B 


17 


26.581 


42.120 


11 .458 


1. 


00 


51.65 


6 


ATOM 


1786 


O 


ASP 


B 


17 


25.837 


43 .066 


11 .684 


1. 


00 


51.12 
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ATOM 


1787 


N 


VAL 


B 


18 


26.156 


40.990 


10 .906 


1. 


00 


53 .24 


7 


ATOM 


1788 


CA 


VAL 


B 


18 


24.772 


40.848 


10 .495 


1. 


00 


53 .38 
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ATOM 


1789 


CB 


VAL 


B 


18 


24.679 


40.682 


8 .978 


1. 


00 


52 .53 
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ATOM 


1790 


CGI 


VAL 
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18 


23 .238 


40 .628 


8 .551 


1. 


00 


53 .33 
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ATOM 


1791 


CG2 


VAL 
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18 


25 .385 


41 . 833 


8.299 


1. 


00 


52 . 08 
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ATOM 


1792 
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VAL 
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18 


24.026 


39 .701 


11.157 


1. 


00 


54 .80 
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1793 
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VAL 
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18 


24.359 


38 .527 


10 .980 
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00 
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ILE 
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ILE 
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1. 
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CB 


ILE 
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1. 
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ILE 
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19 
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40.699 


12 .612 


1. 
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ILE 
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ILE 
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THR 
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10.679 


1. 


,00 
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THR 
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21 
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1811 


CB 


THR 


B 


21 


16.974 
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1, 
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1812 
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THR 
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21 
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CG2 


THR 
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21 
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6 


ATOM 


1815 


O 


THR 
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GLN 
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9.694 
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.00 
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CA 


GLN 
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22 
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35.370 


8.629 
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80.52 
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ATOM 


1818 


CB 


GLN 
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22 


14.492 


33.871 


8.842 
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.00 


81.72 
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ATOM 


1819 


CG 


GLN 
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22 


15.793 


33.097 


8.917 


1 


.00 


83.16 


6 
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ATOM 


1820 


CD 


GLN 


B 


ATOM 


1821 


OE1 


GLN 
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1822 


NE2 


GLN 
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B 


ATOM 


1832 


NH1 


ARG 
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B 


ATOM 


1839 


CG 


ASP 
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VAL 
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CA 


VAL 
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CB 


VAL 
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VAL 
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VAL 
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CA 
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VAL 


B 
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22 15.654 31.832 

22 15.421 31.901 

22 15.787 30.665 

22 13.326 36.091 

22 12.526 35.837 

23 13.075 36.990 
23 11.829 37.769 
23 10.599 36.840 
23 10.348 36.215 
23 10.796 34.734 
23 10.694 34.242 
23 9.590 34.302 
23 8.452 34.832 
23 9.627 33.855 
23. 11.754 38.657 

23 12.776 39.137 

24 10.548 38.879 
24 10.393 39.703 
24 8.975 40.301 
24 8.226 40.339 
24 8.743 40.965 
24 7.113 39.746 
24 10.622 38.798 

24 10.445 39.218 

25 10.994 37.549 
25 11.237 36.584 
25 11.318 35.159 
25 10.001 34.541 
25 10.171 33.024 
25 8.908 32.371 
25 8.747 31.053 
25 9.785 30.235 
25 7.538 30.549 
25 12.527 36.876 

25 13.573 37.105 

26 12.463 36.879 
26 11.233 36.885 
26 13.629 37.134 
26 13.020 37.298 
26 11.627 37.763 
26 14.543 35.917 

26 14.114 34.832 

27 15.801 36.093 
27 16.716 34.969 
27 18.185 35.416 
27 19.127 34.308 
27 18.479 35.737 
27 16.491 34.348 

27 16.517 35.043 

28 16.243 33.045 
28 16.008 32.384 
28 15.095 31.209 
28 17.318 31.938 

28 17.959 31.004 

29 17.696 32.632 
29 18.911 32.353 
29 19.741 33.642 
29 20.986 33.323 
29 20.117 34.272 
29 18.553 31.762 
29 17.731 32.301 



9.753 1.00 85.69 6 

10.976 1.00 86.08 8 

9.103 1.00 85.85 7 

8.717 1.00 81.72 6 

9.632 1.00.80.46 8 

7.765 1.00 84.00 7 

7.744 1.00 86.23 6 

7.695 1.00 87.50 6 

6.314 1.00 89.89 6 

6.215 1.00 90.83 6 

4.833 1.00 93.76 7 

4.068 1.00 95.64 6 

4.534 1.00 96.17 7 

2.811 1.00 95.03 7 

8.987 1.00 86.03 6 

9.476 1.00 85.87 8 

9.497 1.00 86.08 7 

10.693 1.00 85.72 6 

10.799 1.00 89.17 6 

9.453 1.00 90.77 6 

8.486 1.00 92.05 8 

9.376 1.00 89,95 8 

11.891 1.00 83.93 6 

13.045 1.00 83.66 8 

11.613 1.00 81.18 7 

12.677 1. 00 78.99 6 

12.128 1.00 82.30 6 

11.696 1.00 87.15 6 

11.485 1.00 91.09 6 

11.140 1.00 95.87 7 

11.005 1.00 98.43 6 

11.186 1.00 97.96 7 

10.709 1.00 99.56 7 

13.426 1.00 75.18 6 

12.813 1.00 75.11 8 

14.767 1.00 71.32 7 

15.569 1.00 69.54 6 

15.617 1.00 68.20 6 

17.007 1.00 68.04 6 

16.720 1.00 68.94 6 

15.572 1.00 65.66 6 

15.183 1.00 65.23 8 

15.956 1.00 62.13 7 

15.990 1.00 58.35 6 

15.851 1.00 57.74 6 

16.328 1.00 54.93 6 

14.393 1.00 54.94 6 

17.349 1.00 56.29 6 

18.360 1.00 56.31 8 

17.379 1.00 54.44 7 

18.645 1.00 52.98 6 

18.461 1.00 52.15 6 

19.262 1.00 52.73 6 

18.782 1.00 51.86 8 

20.334 1.00 51.10 7 

21.080 1.00 49.08 6 

21.324 1.00 49.72 6 

22.140 1.00 46.43 6 

19.996 1.00 49.12 6 

22.428 1.00 48.65 6 

23.161 1.00.48.71 8 
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ATOM 1881 N SER B 30 

ATOM 1882 CA SER B 3 0 

ATOM 1883 CB SER B 30 

ATOM 1884 OG SER B 30 

ATOM 1885 C SER B 30 

ATOM 1886 O SER B 30 

ATOM 1887 N VAL B 31 

ATOM 1888 CA VAL B 31 

ATOM 1889 CB VAL B 31 

ATOM 1890 CGI VAL B 31 

ATOM 1891 CG2 VAL B 31 

ATOM 1892 C VAL B 31 

ATOM 1893 O VAL B 31 

ATOM 1894 N SER B 32 

ATOM 1895 CA SER B 32 

ATOM 1896 CB SER B 32 

ATOM 1897 OG SER B 32 

ATOM 1898 C SER B 32 

ATOM 1899 O SER B 32 

ATOM 1900 N LEU B 33 • 

ATOM 1901 CA LEU B 33 

ATOM 1902 CB LEU B 33 

ATOM 1903 CG LEU B 33 

ATOM 1904 CD1 LEU B 33 

ATOM 1905 CD2 LEU B 33 

ATOM 1906 C LEU B 33 

ATOM 1907 O LEU B 33 

ATOM 1908 N LYS B 34 

ATOM 1909 CA LYS B 34 

ATOM 1910 CB LYS B 34 

ATOM 1911 CG LYS B 34 

ATOM 1912 CD LYS B 34 

ATOM 1913 CE LYS B 34 

ATOM 1914 NZ LYS B 34 

ATOM 1915 C LYS B 34 

ATOM 1916 O LYS B 34 

ATOM 1917 N PHE B 35 

ATOM 1918 CA PHE B 35 

ATOM 1919 CB PHE B 35 

ATOM 1920 CG PHE B 35 

ATOM 1921 CD1 PHE B 35 

ATOM 1922 CD 2 PHE B 35 

ATOM 1923 CE1 PHE B 35 

ATOM 1924 CE2 PHE B 35 

ATOM 1925 CZ PHE B 35 

ATOM 1926 C PHE B 35 

ATOM 1927 O PHE B 35 

ATOM 1928 N ILE B 36 

ATOM 1929 CA ILE B 36 

ATOM 1930 CB ILE B 36 

ATOM 1931 CG2 ILE B 36 

ATOM 1932 CGI ILE B 36 

ATOM 1933 CD1 ILE B 36 

ATOM 1934 C ILE B 36 

ATOM 1935 O ILE B 36 

ATOM 1936 N ASN B 37 

ATOM 1937 CA ASN B 37 

ATOM 1938 CB ASN B 37 

ATOM 1939 CG . ASN B 37 

ATOM 1940 ODl ASN B 37 

ATOM 1941 ND2 ASN B 37 
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19.169 30.638 22.746 1.00 50.09 7 
18.925 29.997 24.018 1.00 53.68 6 
18.587 28.521 23.817 1.00 53.36 6 
19.653 27.845 23.180 1.00 57.88 8 
20.201 30.150 24.823 1.00 56.00 6 
21.297 29.933 24.306 1.00 57.56 8 
20.049 30.542 26.084 1.00 57.69 7 
21.175 30.752 26.980 1.00 57.19 6 
21.227 32.198 27.460 1.00 57.32 6 
22.536 32.449 28.185 1.00-58.47 6 
21.044 33.147 26.288 1.00 56.64 6 
21.016 29.878 28.204 1.00 57.98 6 
19.938 29.815 28.787 1.00 59.53 8 
22.101 29.232 28.611 1.00 58.26 7 
22.069 28.356 29.765 1.00 58.30 6 
21.806 26.914 29.298 1.00 60.26 6 
21.881 25.975 30.361 1.00 61.39 8 
23.374 28.414 30.530 1.00 58.22 6 
24.402 27.983 30.024 1.00 62.28 8 
23.340 28.937 31.753 1.00 56.90 7 
24.548 29.002 32.572 1.00 56.09 6 
24.489 30.183 33.541 1.00 54.84 6 
24.257 31.555 32.914 1.00 55.33 6 
24.483 32.647 33.962 1.00 53.97 6 
25.201 31.731 31.737 1.00 56.27 6 
24.725 27.728 33.379 1.00 55.50 6 
23.770 27.220 33.950 1.00 56.54 8 
25.948 27.215 33.413 1.00 55.65 7 
26.270 26.018 34.183 1.00 56.27 6 
26.815 24.905 33.279 1.00 60.64 6 
25.908 24.528 32.102 1.00 66.07 6 
24.552 23.965 32.566 1.00 72.18 6 
23.611 23.670 31.373 1.00 74.46 6 
22.303 23.068 31.799 1.00 74.04 7 
27.365 26.493 35.108 1.00 54.54 6 
28.463 26.811 34.655 1.00 55.93 8 
27.079 26.564 36.401 1.00 52.21 7 
28.086 27.045 37.336 1.00 49.54 6 
27.422 27.491 38.633 1.00 46.23 6 
26.545 28.682 38.450 1.00 47.55 6 
25.230 28.536 38.035 1.00 48.89 6 
27.056 29.968 38.603 1.00 48.56 6 
24.434 29.653 37.771 1.00 47.34 6 
26.269 31.087 38.343 1.00 45.87 6 
24.958 30.927 37.926 1.00 47.81 6 
29.224 26.072 37.595 1.00 49.37 6 
29.020 24.880 37.833 1.00 48.82 8 
30.434 26.609 37.530 1.00 47.17 7 
31.634 25.820 37.706 1.00 45.02 6 
32.641 26.106 36.574 1.00 42.09 6 
33.858 25.230 36.717 1.00 38.48 6 
31.966 25.897 35.224 1.00 41.05 6 
-31.393 24.520 35.028 1.00 41.45 6 
32.284 26.141 39.029 1.00 45.51 6 
32.977 25.304 39.603 1.00 45.83 8 
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22.778 40.536 36.774 1.00 40.90 8 

23.257 40.412 38.948 1.00 42.41 7 

25.554 42.768 36.921 1.00 41.50 6 

26.031 42.618 38.042 1.00 42.85 8 

26.250 42.605 35.798 1.00 43.60 7 

27.669 42.266 35.811 1.00 43.31 6 

28.156 41.933 34.415 1.00 42.36 6 

28.359 43.513 36.336 1.00 44.47 6 

28.048 44.637 35.934 1.00 43.75 8 

29.295 43.299 37.244 1.00 44.99 7 

30.009 44.379 37.895 1.00 45.69 6 

30.052 44.061 39.418 1.00 46.91 6 

31.419 43.514 39.831 1.00 49.50 6 

29.726 45.288 40.232 1.00 48.64 6 

29.920 45.030 41.718 1.00 53.73 6 

31.428 44.532 37.302 1.00 45.86 6 

32.156 45.487 37.611 1.00 45.24 8 

31.804 43.581 36.453 1.00 41.25 7 

33.104 43.578 35.813 1.00 38.58 6 

34.056 42.662 36.568 1.00 35.19 6 

33.682 41.315 36.388 1.00 35.43 8 

32.852 43.015 34.431 1.00 40.88 6 
31.776 42.493 34.174 1.00 39.63 8 
33.815 43.131 33.524 1.00 43.16 7 
33.598 42.557 32.212 1.00 43.98 6 
34.355 43.325 31.127 1.00 46.29 6 
35.769 43.727 31.434 1.00 50.31 6 
36.225 44.764 30.401 1.00 52.39 6 

35.853 44.341 28.978 1.00 52,02 6 
36.333 45.308 27.965 1.00 54.82 7 
33.963 41.075 32.230 1.00 43.71 6 
34.673 40.602 33.114 1.00 44.78 8 
33.443 40.310 31.267 1.00 44.16 7 
32.562 40.750 30.171 1.00 42.37 6 
33.704 38.873 31.184 1.00 39.82 6 
32.836 38.422 30.016 1.00 40.83 6 
31.813 3-9.505 29.881 1.00 42.58 6 
35.141 38.524 30.941 1.00 39.41 6 
^5.772 39.048 30.032 1.00 40.47 8 
35.663 37.637 31.765 1.00 39.61 7 
37.020 37.175 31.582 1.00 39.82 6 
37.765 37.046 32.915 1.00 41.36 6 
39.238 36.644 32.763 1.00 50.17 6 
39.989 36.540 34.094 1.00 55.05 6 
39.506 37.129 35.084 1.00 57.32 8 
41.067 35.888 34.153 1.00 56.17 8 
36.802 35.804 30.966 1.00 39.55 6 
36.537 34.840 31.676 1.00 38.71 8 
36.864 35.736 29.638 1.00 36.87 7 
36.690 34.475 28.938 1.00 35.52 6 
36.457 34.702 27.448 1.00 35.28 6 
36.249 33.378 26.752 1.00 33.50 6 
35.249 35.586 27.255 1.00 31.71 6 
37.935 33.640 29.157 1.00 35.07 6 
39.025 34.005 28.741 1.00 37.13 8 
37.759 32.511 29.823 1.00 35.85 7 
38.866 31.631 30.167 1.00 36.60 6 
38.554 30.913 31.482 1.00 37.34 6 
38.127 31.727 32.701 1.00 39.09 6 
37.534 30.812 33.739 1.00 37.97 6 
39.306 32.469 33.259 1.00 41.28 6 
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51 . 11 


O 


ATOM 


4385 


CG2 


ILE 


C 


140 


31 . 536 


62 . 955 


32 . 134 


1 . 


00 


CO AC 

52 . 05 


b 


ATOM 


4386 


CGI 


ILE 


C 


140 


29 . 930 


63 . 555 


30.393 


1 . 


00, 


A f\ 11 

49 . 11 


c 
0 


ATOM 


4387 


CD1 


ILE 


C 


140 


29.307 


62 .187 


30 . 676 


1 . 


00 


49 . 89 


b 


ATOM 


4388 


C 


ILE 


C 


140 


31 . 624 


65. 861 


29 . 560 


1 . 


00 


47 . 99 


b 


ATOM 


4389 


O 


ILE 


C 


140 


30.515 


66.323 


29.365 


1 . 


00 


50 . 04 


8 


ATOM 


4390 


N 


GLY 


C 


141 


32 .620 


66.025 


28.701 


1 . 


00 


46 . 57 


7 


ATOM 


4391 


CA 


GLY 


C 


141 


32 .414 


66.732 


27 .447 


1 . 


00 


46 . 87 


6 


ATOM 


4392 


C 


GLY 


C 


141 


33 .453 


66.323 


26.416 


1 . 


00 


46 . 66 


b 


ATOM 


4393 


O 


GLY 


C 


141 


34.359 


65 .565 


26 . 739 


1 . 


00 


A C A 1 

46 . 42 


8 


ATOM 


4394 


N 


SER 


C 


142 


33 .329 


66 . 804 


25. 180 


1 . 


00 


45 . 19 


7 


ATOM 


4395 


CA 


SER 


C 


142 


34.303 


66 .474 


24. 140 


1 . 


00 


41 . 98 


6 


ATOM 


4396 


CB 


SER 


C 


142 


33 .974 


67 .165 


22 . 828 


1 . 


00 


A A A/T 

40 . 96 


b 


ATOM 


4397 


OG 


SER 


C 


142 


35 .062 


67 . 057 


21. 943 


1 . 


00 


n A oo 

34 . 78 


o 
O 


ATOM 


4398 


C 


SER 


C 


142 


35 .698 


66.885 


24.551 


1 . 


00 


43 .33 


6 


ATOM 


4399 


O 


SER C 


142 


35 .915 


67 . 956 


25.115 


1 . 


00 


45 . 01 


8 


ATOM 


4400 


N 


TRP 


c 


143 


36 .655 


66.028 


24.256 


1 . 


00 


43 .12 


7 


ATOM 


4401 


CA 


TRP 


C 


143 


38 .025 


66.300 


24. 622 


1 . 


00 


42 . 98 


6 


ATOM 


4402 


CB 


TRP 


C 


143 


38 .768 


64 .982 


24 . 819 


1 . 


00 


41 . 57 


6 


ATOM 


4403 


CG 


TRP 


C 


143 


40.125 


65.141 


25 . 446 


1 . 


00 


39 . 54 


6 


ATOM 


4404 


CD2 


TRP 


C 


143 


40.394 


65 .420 


26. 820 


1 . 


00 


36 . 50 


6 


ATOM 


4405 


CE2 


TRP 


C 


143 


41.795 


65 .481 


26 .967 


1 . 


00 


35.30 


6 


ATOM 


4406 


CE3 


TRP 


C 


143 


39 .584 


65 . 625 


27 . 943 


1 . 


.00 


36 . 55 


6 


ATOM 


4407 


CDl 


TRP 


C 


143 


41.345 


65. 048 


24. 829 


1 , 


00 


38 . 96 


6 


ATOM 


4408 


NEl 


TRP 


c 


143 


42 .353 


65.251 


25.738 


1 . 


, 00 


35.00 


7 


ATOM 


4409 


CZ2 


TRP 


c 


143 


42.400 


65.736 


28.192 


1. 


.00 


34 . 47 


6 


ATOM 


4410 


CZ3 


TRP 


c 


143 


40.185 


65.878 


29. 153 


1 . 


. 00 


35 . 28 


6 


ATOM 


4411 


CH2 


TRP 


c 


143 


41.580 


65.931 


29.271 


1 . 


, 00 


35 .16 


6 


ATOM 


4412 


C 


TRP 


c 


143 


38 .767 


67 .159 


23 . 605 


1 , 


, 00 


43 . 69 


6 


ATOM 


4413 


O 


TRP 


c 


143 


39 .657 


67.915 


23.962 


1 , 


. 00 


46 . 65 


8 


ATOM 


4414 


N 


THR 


c 


144 


38 .402 


67.065 


22.338 


1 , 


, 00 


42 .21 


7 


ATOM 


4415 


CA 


THR 


c 


144 


39.107 


67.834 


21.333 


1 


.00 


40 .90 


6 


ATOM 


4416 


CB 


THR 


c 


144 


39.839 


66.901 


20 .372 


1 . 


, 00 


40 . 06 


6 


ATOM 


4417 


OG1 


THR 


c 


144 


38 .907 


65 .976 


19.798 


1 , 


. 00 


40 . 17 


8 


ATOM 


4418 


CG2 


THR 


c 


144 


40.916 


66 . 144 


21.106 


1 


.00 


38.34 


6 


ATOM 


4419 


C 


THR 


c 


144 


38 .252 


68.795 


20 . 520 


1 


.00 


43 . 03 


6 


ATOM 


4420 


O 


THR 


c 


144 


38 .786 


69.631 


19.795 


1 


.00 


43 . 02 


8 


ATOM 


4421 


N 


HIS 


c 


145 


36.934 


68.687 


20. 635 


1 


. 00 


42 . 41 


7 


ATOM 


4422 


CA 


HIS 


c 


145 


36.065 


69.571 


19.885 


1 


.00 


45 . 42 


6 


ATOM 


4423 


CB 


HIS 


c 


145 


34.994 


68,772 


19.144 


1 


. 00 


48 . 25 


6 


ATOM 


4424 


CG 


HIS 


c 


145 


35.533 


67.873 


18.071 


1 


.00 


49 .32 


6 


ATOM 


4425 


CD2 


HIS 


c 


145 


36.052 


68.154 


16.853 


1 


.00 


47 . 20 


6 


ATOM 


4426 


NDl 


HIS 


c 


145 


35.572 


66.500 


18.197 


1 


.00 


47 .13 


7 


ATOM 


4427 


CEl 


HIS 


c 


145 


36.091 


65.976 


17.103 


1 


.00 


47 . 99 


6 


ATOM 


4428 


NE2 


HIS 


c 


145 


36.391 


66.958 


16.272 


1 


.00 


48 . 34 


7 


ATOM 


4429 


C 


HIS 


c 


145 


35.394 


70.627 


20.754 


1 


.00 


48 .71 


6 


ATOM 


4430 


O 


HIS 


c 


145 


34.738 


70.325 


21.746 


1 


.00 


47 . 60 


8 


ATOM 


4431 


N 


HIS 


c 


146 


35.562 


71.883 


20.363 


1 


.00 


52 .53 


7 


ATOM 


4432 


CA 


HIS. C 


146 


34.972 


72.993 


21.094 


1.00 


53 .12 


6 


ATOM 


4433 


CB 


HIS 


c 


146 


35.777 


74.261 


20.840 


1 


.00 


50 .98 


6 


ATOM 


4434 


CG 


HIS 


c 


146 


•35.931 


74.586 


19.390 


1 


.00 


48 . 89 


6 


ATOM 


4435 


CD2 


HIS 


c 


146 


35.013 


74.755 


18.409 


1 


.00 


47 .83 


6 


ATOM 


4436 


NDl 


HIS 


c 


146 


37.161 


74.776 


18.801 


1 


.00 


48.82 


7 


ATOM 


4437 


CEl 


HIS 


c 


146 


36.993 


75.049 


17.519 


1 


.00 


48.32 


6 


ATOM 


4438 


NE2 


HIS 


c 


146 


35.699 


75.043 


17.257 


1 


.00 


46.12 


7 


ATOM 


4439 


c ; 


HIS 


c 


146 


33.524 


73.188 


20.664 


1 


.00 


54.19 


6 


ATOM 


4440 


O 


HIS 


c 


146 


33.047 


72.531 


19.736 


1 


.00 


54.21 


8 


ATOM 


4441 


N 


SER 


c 


147 


32.847 


74.112 


21.341 


1 


.00 


56.30 


7 


ATOM 


4442 


CA 


SER 


c 


147 


31.437 


74.418 


21.115 


1 


.00 


57.23 


6 
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ATOM 


4443 


CB 


SER C 


147 


ATOM 


4444 


OG 


SER C 


147 


ATOM 


4445 


C 


SER C 


147 


ATOM 


4446 


O 


SER C 


147 


ATOM 


4447 


N 


ARG C 


148 


ATOM 


4448 


CA 


ARG C 


148 


ATOM 


4449 


CB 


ARG C 


148 


ATOM 


4450 


CG 


ARG C 


148 


ATOM 


4451 


CD 


ARG C 


148 


ATOM 


4452 


NE 


ARG C 


148 


ATOM 


4453 


CZ 


ARG C 


148 


ATOM 


4454 


NHl 


ARG C 


148 


ATOM 


4455 


NH2 


ARG C 


148 


ATOM 


4456 


C 


ARG C 


148 


ATOM 


4457 


O 


ARG C 


148 


ATOM 


4458 


N 


GLU C 


149 


ATOM 


4459 


CA 


GLU C 


149 


ATOM 


4460 


CB 


GLU C 


149 


ATOM 


4461 


CG 


GLU C 


149 


ATOM 


4462 


CD 


GLU C 


149 


ATOM 


4463 


OE1 


GLU C 


149 


ATOM 


4464 


OE2 


GLU C 


149 


ATOM 


4465 


C 


GLU C 


149 


ATOM 


4466 


O 


GLU C 


149 


ATOM 


4467 


N 


ILE C 


150 


ATOM 


4468 


CA 


ILE C 


150 


ATOM 


4469 


CB 


ILE C 


150 


ATOM 


4470 


CG2 


ILE C 


150 


ATOM 


4471 


CGI 


ILE C 


150 


ATOM 


4472 


CDl 


ILE C 


150 


ATOM 


4473 


C 


ILE C 


150 


ATOM 


4474 


O 


ILE C 


150 


ATOM 


4475 


N 


SER C 


151 


ATOM 


4476 


CA 


SER C 


151 


ATOM 


4477 


CB 


SER C 


151 


ATOM 


4478 


OG 


SER C 


151 


ATOM 


4479 


C 


SER C 


151 


ATOM 


4480 


O 


SER C 


151 



ATOM 4481 N VAL C 152 

ATOM 4482 CA VAL C 152 

ATOM 4483 CB VAL C 152 

ATOM 4484 CGl VAL C 152 

ATOM 4485 CG2 VAL C 152 

ATOM 4486 C VAL C 152 

ATOM 4487 O VAL C 152- 

ATOM 4488 N ASP C 153 

ATOM 4489 CA ASP C 153 

ATOM 4490 CB ASP C 153 

ATOM 4491 CG ASP C 153 

ATOM 4492 ODl ASP C 153 

ATOM 4493 OD2 ASP C 153 

ATOM 4494 C ASP C 153 

ATOM 4495 O ASP C 153 

ATOM 4496 N PRO C 154 

ATOM 4497 CD PRO C 154 

ATOM 4498 CA PRO C 154 

ATOM 4499 CB PRO C 154 

ATOM 4500 CG PRO C 154 

ATOM 4501 C PRO C 154 

ATOM 4502 O PRO C 154 

ATOM 4503 N THR C 155 
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31.055 75.610 21.978 1.00 57.36 6 

32.017 76.635 21.828 1.00 59.18 8 

30.972 74.660 19.682 1.00 57.72 6 

29.790 74.484 19.375 1.00 57.25 8 

31.885 75.065 18.809 1.00 58.23 7 

31.517 75.336 17.424 1.00 60.12 6 

32.555 76.264 16.777 1.00 63.75 6 

32.799 77.567 17.549 1.00 70.83 6 

33.950 78.393 16.946 1.00 77.31 6 

34.422 79.453 17.852 1.00 84.18 7 

33.696 80.506 18.245 1.0086. 23 6 

32.447 80.661 17.818 1.00 87.64 7 

34.213 81.408 19.072 1.00 86.26 7 

31.390 74.051 16.601 1.00 59.83 6 

30.786 74.053 15.519 1.00 60.30 8 

31.954 72.959 17.121 1.00 57.69 7 

31.937 71.677 16.425 1.00 54.67 6 

33.364 71.132 16.321 1.00 52.47 6 

34.395 72.228 16.050 1.00 52.11 6 

35.824 71.718 15.896 1.00 50.20 6 

36.246 70.852 16.678 1.00 49.43 8 

36.537 72.203 15.004 1.00 47.62 8 

31.043 70.698 17.162 1.00 53.64 6 

30.252 69.985 16.552 1.00 53.16 8 

31.172 70.667 18.479 1.00 52.16 7 

30.353 69.795 19.289 1.00 51.98 6 

31.157 68.612 19.883 1.00 53.49 6 

30.361 67.954 21.019 1.00 52.00 6 

31.450 67.571 18.800 1.00 53.48 6 

32.235 66.383 19.299 1.00 50.44 6 

29.750 70.565 20.446 1.00 53.26 6 

30.410 71.389 21.095 1.00 51.01 8 

28.479 70.280 20.694 1.00 54.99 7 

27.749 70.887 21.797 1.00 56.94 6 

26.693 71.873 21.280 1.00 54.85 6 

25.665 71.221 20.557 1.00 55.92 8 

27.084 69.717 22.516 1.00 59.00 6 

26.536 68.820 21.871 1.00 60.03 8 

27.172 69.705 23.844 1.00 60.32 7 

26.569 68.651 24.655 1.00 61.16 6 

27.564 68.096 25.694 1.00 60.14 6 

28.858 67.723 25.009 1.00 61.98 6 

27.817 69.110 26.775 1.00 61.23 6 

25.373 69.237 25.391 1.00 62.82 6 

25.379 70.409 25.758 1.00 63.73 8 

24.349 68.425 25.614 1.00 65.81 7 

23.147 68.887 26.293 1.00 67.23 6 
22.150 69.377 25.249 1.00 69.19 6 
22.748 70.425 24.320 1.00 72.38 6 
22.786 71.614 24.718 1.00 72.90 8 
23.193 70.060 23.201 1.00 75.01 8 
22.505 67.776 27.120 1.00 69.10 6 
•22.504 66.612 26.717 1.00 68.47 8 
21.970 68.115 28.304 1.00 70.49 7 
22.282 69.309 29.094 1.00 69.97 6 
21.325 67.098 29.147 1.00 73.18 6 

21.148 67.814 30.477 1.00 70.84 6 
22.300 68.745 30.493 1.00 71.35 6 
19.985 66.669 28.522 1.00 77.25 6 
.19.591 67.183 27.462 1.00 77.76 8 
19.279 65.744 29.175 1.00 80.57 7 
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ATOM 


4504 


CA 


THR C 155 


18.010 


65.253 


28. 


633 


1.00 83.84 


6 


ATOM 


4505 


CB 


THR C 155 


18.244 


63.932 


27. 


837 


1.00 83.53 


6 


ATOM 


4506 


OG1 


THR C 155 


18.609 


62 .882 


28. 


744 


1.00 83.26 


8 


ATOM 


4507 


CG2 


THR C 155 


19.361 


64.105 


26. 


823 


1.00 83.08 


6 


ATOM 


4508 


C 


THR C 155 


16.897 


65.001 


29. 


678 


1.00 87.74 


6 


ATOM 


4509 


O 


THR C 155 


16.826 


65.677 


30. 


715 


1.00 87.94 


8 


ATOM 


4510 


N 


THR C 156 


16.042 


64.012 


29. 


370 


1.00 91.72 


7 


ATOM 


4511 


CA 


THR C 156 


14.882 


63 .577 


30. 


180 


1.00 93.73 


6 


ATOM 


4512 


CB 


THR C 156 


14.501 


62.092 


29. 


877 


1.00 93.94 


6 


ATOM 


4513 


OG1 


THR C 156 


14.249 


61.933 


28. 


470 


1.00 93.93 


8 


ATOM 


4514 


CG2 


THR C 156 


13.253 


61.681 


30. 


696 


1.00 93.29 


6 


ATOM 


4515 


C 


THR C 156 


15.042 


63.695 


31. 


693 


1.00 95.32- 


6 


ATOM 


4516 


O 


THR C 156 


15.626 


62.817 


32. 


347 


1.00 95.61 


8 


ATOM 


4517 


N 


GLU C 157 


14.490 


64.767 


32. 


246 


1.00 97.07 


7 


ATOM 


4518 


CA 


GLU C 157 


14.578 


65.011 


33. 


679 


1.00 98.94 


6 


ATOM 


4519 


CB 


GLU C 157 


14.487 


66.514 


33. 


942 


1.00100.27 


6 


ATOM 


4520 


CG 


GLU C 157 


15.282 


67.359 


32. 


950 


1.00102 .55 


6 


ATOM 


4521 


CD 


GLU C 157 


• 15.113 


68.852 


33.214 


1.00104.54 


6 


ATOM 


4522 


OE1 


GLU C 157 


13.943 


69.322 


33.312 


1.00105.88 


8 


ATOM 


4523 


OE2 


GLU C 157 


16.150 


69.555 


33.327 


1.00104.90 


8 


ATOM 


4524 


c' 


GLU C 157 


13.475 


64.290 


34. 


465 


1.00 99.30 


6 


ATOM 


4525 


O 


GLU C 157 


13.452 


64.337 


35. 


706 


1.00 99.77 


8 


ATOM 


4526 


N 


ASN C 158 


12.557 


63.639 


33. 


751 


1.00 99.15 


7 


ATOM 


4527 


CA 


ASN C 158 


11.457 


62.919 


34. 


404 


1.00 98.32 


6 


ATOM 


4528 


CB 


ASN C 158 


10.382 


62.541 


33. 


374 


1.00100.43 


6 


ATOM 


4529 


CG 


ASN C 158 


9.902 


63.736 


32. 


,555 


1.00101.38 


6 


ATOM 


4530 


OD1 


ASN C 158 


9.423 


64.738 


33. 


,112 


1.00101.96 


8 


ATOM 


4531 


ND2 


ASN C 158 


10.026 


63.636 


31. 


,224 


1.00101.40 


7 


ATOM 


4532 


C 


ASN C 158 


11.991 


61.638 


35, 


.064 


1.00 96.64 


6 


ATOM 


4533 


O 


ASN C 158 


12.380 


61.643 


36. 


.239 


1.00 96.33 


8 


ATOM 


4534 


N 


SER C 159 


11.992 


60.558 


34 


.283 


1.00 93.61 


7 


ATOM 


4535 


CA 


SER C 159 


12.466 


59.237 


34.690 


1.00 90.35 


6 


ATOM 


4536 


CB 


SER C 159 


13.541 


58.788 


33.690 


1.00 90.84 


6 


ATOM 


4537 


OG 


SER C 159 


14.367 


59.898 


33 


.320 


1.00 90.44 


8 


ATOM 


4538 


C 


SER C 159 


13.005 


59.167 


36 


.123 


1.00 88.15 


6 


ATOM 


4539 


O 


SER C 159 


13.942 


59.894 


36 


.481 


1,00 88.99 


8 


ATOM 


4540 


N 


ASP C 160 


12.414 


58.302 


36 


.945 


1.00 84.53 


7 


ATOM 


4541 


CA 


ASP C 160 


12.863 


58.152 


38 


.330 


1.00 80.47 


6 


ATOM 


4542 


CB 


ASP C 160 


12.232 


56.921 


38 


.985 


1.00 80.26 


6 


ATOM 


4543 


CG 


ASP C 160 


12.760 


56.683 


40 


.398 


1.00 82.12 


6 


ATOM 


4544 


ODl 


ASP C 160 


12.658 


55.536 


40 


.890 


1.00 82.78 


8 


ATOM 


4545 


OD2 


ASP C 160 


13.277 


57.644 


41 


.023 


1.00 82.17 


8 


ATOM 


4546 


C 


ASP C 160 


14.380 


57.978 


38 


.340 


1.00 77.39 


6 


ATOM 


4547 


O 


ASP C 160 


14.892 


57.009 


37 


.768 


1.00 75.75 


8 


ATOM 


4548 


N 


ASP C 161 


15.080 


58.910 


38 


.992 


1.00 73.66 


7 


ATOM 


4549 


CA 


ASP C 161 


16.529 


58.864 


39 


.072 


1.00 71.77 


6 


ATOM 


. 4550 


CB 


ASP C 161 


17.060 


59.934 


40 


.028 


1.00 71.93 


6 


ATOM 


4551 


CG 


ASP C 161 


16.943 


61.330 


39 


.461 


1.00 72.28 


6 


ATOM 


4552 


ODl 


ASP C 161 


17.115 


61.486 


38 


.230 


1.00 71.71 


8 


ATOM 


4553 


OD2 


ASP C 161 


16.695 


62.270 


40 


.246 


1.00 74.13 


8 


ATOM 


4554 


C 


ASP C 161 


17.118 


57.515 


39 


.479 


1.00 70.97 


6 


ATOM 


4555 


O 


ASP C 161 


18.296 


57.251 


39 


.222 


1.00 73.00 


8 


ATOM ' 


4556 


N 


SER C 162 


16.335 


56.649 


40 


.105 


1.00 68.54 


7 


ATOM 


4557 


CA 


SER C 162 


16.899 


55.366 


40 


.496 


1.00 66.65 


6 


ATOM 


4558 


CB 


SER C 162 


17.085 


55.308 


42 


.020 


1.00 66.09 


6 


ATOM 


4559 


OG 


SER C 162 


15.845 


55.352 


42 


.698 


1.00 66.84 


8 


ATOM 


4560 


C 


SER C 162 


16.064 


54.194 


40 


.019 


1.00 65.29 


6 


ATOM 


4561 


O 


SER C 162 


16.042 


53.147 


40 


.646 


1.00 65.08 


8 


ATOM 


4562 


N 


GLU C 163 


15.393 


54.357 


38 


.892 


1.00 64.69 


7 


ATOM 


4563 


CA 


GLU C 163 


14.571 


53.270 


38 


.398 


1.00 66.58 


6 


ATOM 


4564 


CB 


GLU C 163 


13.543 


53.784 


37 


.372 


1.00 68.86 
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14.029 53.942 35.951 1.00 70.35 6 

12.886 54.261 35.000 1,00 73.03 6 

12.345 55.390 35.065 1.00 75.39 8 

12.517 53.378 34.193 1.00 72.63 8 

15.416 52.145 37.808 1.00 65.46 6 

14.902 51.071 37.481 1.00 64.98 8 

16.718 52.392 37.675 1.00 65.54 7 

17.647 51.389 37.143 1.00 63.20 6 

18.353 51.894 35.884 1.00 63.41 6 

17.433 52.101 34.716 1.00 65.49 6 

17.299 53.355 34.119 1.00 66.61 6 

16.429. 53.555 33.046 1.00 68.54 6 

16.676 51.046 34.216 1.00 68.25 6 

15.797 51.230 33.144 1.00 70.31 6 

15.680 52.484 32.562 1.00 70,26 6 

14.832 52.655 31.482 1.00 74.21 8 

18.690 51.066 38.184 1.00 61.13 6 

19.480 50.147 38.003 1.00 60.53 8 

18.687 51.824 39.279 1.00 59.92 7 

19.657 51.622 40.347 1.00 58.58 6 

19.497 52.690 41.425 1.00 56.39 6 

20.717 52.856 42.288 1.00 55.60 6 

21.904 53.336 41.742 1.00 53.27 6 

20.692 52.497 43.629 1.00 54.20 6 

23.049 53.451 42.513 1.00 53.30 6 
21.835 52.607 44.414 1.00 54.70 6 
23.018 53.085 43.854 1.00 54.90 6 
19.528 50.250 40.974 1.00 58.64 6 
18.422 49.749 41.153 1.00 60.35 8 
20.655 49.631 41.298 1.00 58.50 7 
20.614 48.309 41.900 1.00 58.50 6 
22.013 47.703 41.996 1.00 58.83 6 
21.957 46.391 42.542 1.00 61.34 8 

20.050 48.473 43.286 1.00 58.49 6 
20.346 49.454 43.964 1.00 58.40 8 
19.249 47.503 43.706 1.00 57.37 7 
18.631 47.545 45.020 1.00 57.17 6 
17.317 46.766 44.994 1.00 59.70 6 
17.467 45.351 44.490 1.00 62.90 6 
16.136 44.696 44.164 1.00 67.19 6 
15.284 44.519 45.048 1.00 68.37 8 
15.940 44.334 42.884 1.00 66.77 7 
19.548 46.975 46.085 1.00 55.30 6 
19.373 47.235 47.271 1.00 53.36 8 
20.541 46.211 45.659 1.00 55.00 7 
21.45.5 45.601 46.609 1.00 55.48 6 
21.845 44.214 46.114 1.00 55.25 6 
20.630 43.413 45.714 1.00 56.93 6 
20.242 43.315 44.378 1.00 56.29 6 
19.087 42. "640 44.016 1.00 56.50 6 
19.825 42.809 46.680 1.00 57.12 6 
18.664 42.133 46.332 1.00 56.97 6 
18.300 42.054 44.997 1>00 58.82 6 
17.133 41.408 44.640 1.00 61.29 8 
22.692 46.431 46.919 1.00 55.31 6 
23.582 45.987 47.637 1.00 53.34 8 
22.733 47.646 46.384 1.00 56.64 7 
23.851 48.553 46.620 1.00 58.01 6 
23.731 49.786 45,720 1.00 58.39 6 
24.745 50.727 46.017 1.00 57.32 8 
23.860 49.001 48.076 1.00 60.12 6 
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22.803 
25.052 
25.174 
26.636 
26.999 
28.466 
29.418 
30.317 
30.384 
31.166 
24.668 
24.416 
24.498 
24.063 
25.131 
26.521 
27 .189 
27.145 
28.463 
28.412 
29.074 
22.732 
22.164 
22.228 
20.947 
19.806 
19.891 
' 18.753 
17.626 
18.976 
21.014 
21.815 
20.186 
20.182 
20.016 
19.918 
21.191 
21.053 
19.088 
17.912 
19.480 
18.510 
19.164 
19.967 
20.647 
19.054 
17.985 
16.781 
18.893 
18.485 
17.949 
17.419 
16.598 
* 17.812 
19.651 
20.829 
19.312 
20.308 
20.426 
21.382 
20.900 



49.148 
49.215 
49.662 
49.602 
48.350 
47.972 
49.012 
49.556 
49.165 
50.480 
51.102 
51.609 
51.759 
53.146 
54.016 
53.719 
52.568 
54.559 
52.257 
54.258 
53.102 
53.346. 
52.406 
54.569 
54.877 
54.769 
55.739 
55.551 
55.236 
55.734 
56.279 
57.116 
56.530 
57.827 
57.687 
59.071 
56.896 
56.663 
58.756 
58.391 
59.972 
60.943 
61.916 
61.213 
62.264 
60.303 
61.680 
61.734 
62.223 
62.951 
64.334 
65.160 
64.650 
66.338 
63.084 
63.031 
. 63.237 
63.391 
62.109 
62.345 
60.943 



48.699 

48.621 
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51.424 
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1.00 60.78 
1.00 59.44 
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00 68.28 
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1.00 56.45 
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ATOM 4719 CE LYS C 180 

ATOM 4720 NZ LYS C 180 

ATOM 4721 C LYS C 180 

ATOM 4722 O LYS C 180 

ATOM 4723 N ASN C 181 

ATOM 4724 CA ASN C 181 

ATOM 4725 CB ASN C 181 

ATOM 4726 CG ASN C 181 

ATOM 4727 OD1 ASN C 181 

ATOM 4728 ND2 ASN C 181 

ATOM 4729 C ASN C 181 

ATOM 4730 O ASN C 181 

ATOM 4731 N SER C 182 

ATOM 4732 CA SER C 182 

ATOM 4733 CB SER C 182 

ATOM 4734 OG SER C 182 

ATOM 4735 C SER C 182 

ATOM 4736 O SER C 182 

ATOM 4737 N VAL C 183 

ATOM 4738 CA VAL C 183 

ATOM 4739 CB VAL C 183 

ATOM 4740 CGI VAL C 183 

ATOM 4741 CG2 VAL C 183 

ATOM 4742 C VAL C 183 

ATOM 4743 O VAL C 183 

ATOM 4744 N THR C 184 

ATOM 4745 CA THR C 184 

ATOM 4746 CB THR C 184 

ATOM 4747 OG1 THR C 184 
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19.869 64.532 39.368 1.00 67.12 6 
18.715 64.589 38.956 1.00 67.64 8 
20.786 65.447 39.075 1.00 68.63 7 
20.482 66.573 38.200 1.00 69.48 6 
20.215 67.861 39.004 1.00 69.26 6 
21.310 68,. 109 39.894 1.00 68.92 8 
18.932 67.718 39.804 1.00 68.40 6 
21.640 66.813 37.245 1.00 70.44 6 
22.802 66.641 37.614 1.00 69.76 8 
21.312 67.207 36.018 1.00 71.94 7 
22.320 67.470 34.998 1.00 73.43 6 
22.150 66.500 33.831 1.00 76.20 6 
21.560 65.148 34.220 1.00 80.00 6 
21.896 64.046 33.196 1.00 83.21 6 
21,745 64.237 31.973 1.00 84.40 8 
22.346 62.889 33.694 1.00 82.90 7 
22.149 68.884 34.482 1.00 72.16 6 
21.070 69.244 34.044 1.00 72.49 8 
23.214 69.675 34.522 1.00 71.74 7 
23.166 71.054 34.048 1.00 71.38 6 
23.205 72.022 35.233 1.00 73.17 6 
22.291 71.610 36.380 1.00 78.41 6 
22.499 72.459 37.644 1.00 79.07 6 
21.814 71.821 38.864 1.00 80.35 6 
22.363 70.452 39.163 1.00 81.10 7 
24.384 71.301 33.176 1.00 70.13 6 
25.504 71.353 33.681 1.00 70.65 8 
24.180 71.466 31.876 1.00 68.21 7 
25.306 71.719 30.978 1.00 67.25 6 
24.833 71.667 29.519 1.00 67.12 6 
24.008 72.846 29.053 1.00 63.90 6 
24.908 73.920 28.488 1.00 63.70 6 
25.645 73.452 27.223 1.00 63.89 6 
24.768 73.295 26.013 1.00 62.80 7 
25.971 73.075 31.266 1.00 67.33 6 
25.552 73.803 32.160 1.00 66.68 8 
27.027 73.399 30.528 1.00 67.89 7 
27.698 74.674 30.702 1.00 67.78 6 
27.967 74.948 32.191 1.00 69.90 6 
28.580 73.770 32.916 1.00 70.82 6 
29.508 73.137 32.422 1.00 73.77 8 
28.071 73.481 34.109 1.00 70.00 7 

28.977 74.884 29.901 1.00 67.29 6 
29.937 74.153 30.049 1.00 68.57 8 

28.978 75.911 29.058 1.00 67.43 7 
30.134 76.248 28.233 1.00 66.28 6 
29.726 77.290 27.186 1.00 65.32 6 
30.731 77.477 26.214 1.00 68.09 8 
31.230 76.798 29.141 1.00 65.83 6 
30.941 77.272 30.231 1.00 66.92 8 
32.483 76.731 28.698 1.00 65.76 7 
33.613 77.211 29.498 1.00 65.65 6 
.33.872-76.264 30.696 1.00 63.98 6 
33.648 74.843 30.277 1.00 64.03 6 
35.292 76.442 31.209 1.00 61.75 6 
34.925 77.375 28.728 1.00 65.88 6 
35.305 76.515 27.944 1.00 65.40 8 
35.616 78.485 28.957 1.00 66.99 7 
36.892 78.722 28.293 1.00 68.57 6 
36.995 80.168 27.732 1.00 67.80 6 
35.981 80.369 26.737 1.00 66.40 8 
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38.370 80.406 27.094 1.00 66.48 6 

38.032 78.470 29.292 1.00 70.57 6 

37.920 78.802 30.482 1.00 70.99 8 

39.118 77.869 28.815 1.00 70.74 7 

40.236 77.572 29.683 1.00 71.98 6 

40.555 76.067 29.658 1.00 72,58 6 

39.351 75.195 29.937 1.00 72.59 6 

38.363 75.011 28.970 1.00 73.55 6 

37.224 74.258 29.236 1.00 72.54 6 

39.164 74.597 31.185 1.00 72.27 6 

38.017 73.839 31.461 1.00 71.72 6 

37.057 73.678 30.480 1.00 71.73 6 

35.920 72.951 30.732 1.00 71.36 8 

41.426 78.355 29.191 1.00 73.72 6 

41.625 78.492 27.983 1.00 74.41 8 

42.220 78.869 30.125 1.00 75.63 7 

43.405 79.653 29.772 1.00 77.64 6 

44.183 80.015 31.043 1.00 77.47 6 

44.398 78.870 31.858 1.00 78.93 8 

44.300 78.875 28.793 1.00 77.75 6 

44.926 79.457 27.893 1.00 76.44 8 

44.332 77.556 28.977 1.00 78.65 7 

45.116 76.648 28.135 1.00 79.38 6 

44.658 76.703 26.715 1.00 79.55 6 

45.443 76.687 25.771 1.00 79.09 8 

44.899 75.178 28,529 1.00 79.04 6 

43.205 74.462 28.241 1.00 80.14 16 

43.345 76.774 26.589 1.00 80.45 7 

42.727 76.642 25.305 1.00 79.63 6 

41.779 77.755 24.835 1.00 79.24 6 

40.785 78.065 25.504 1.00 81.46 8 

42.037 75.269 25.365 1.00 79.66 6 

42.984 73.940 26.264 1.00 75.61 16 

42.070 78.346 23.656 1.00 77.55 7 

43.222 77.849 22.879 1.00 76.91 6 

41.378 79.431 22.922 1.00 75.67 6 

41.921 79.283 21.502 1.00 75.82 6 

43.328 78.858 21.747 1.00 77.27 6 

39.835 79.468 22.927 1.00 73.97 6 

39.233 80.403 23.459 . 1.00 73.90 8 

39.201 78.465 22.317 1.00 71.85 7 

37.734 78.404 22.246 1.00 68.61 6 

37.305 77.497 21.099 1.00 70.45 6 

38.277 77.434 19.945 1.00 72.05 6 

38.082 78.554 18.969 1.00 73.59 6 

36.908 78.876 18.657 1.00 73.46 8 

39.106 79.100 18.504 1.00 77.52 8 

37.084 77.889 23.528 1.00 64.48 6 

37.762 77.596 24.501 1.00 64.47 8 

35.764 77.765 23.514 1.00 60.72 7 

35.035 77.281 24.679 1.00 60.54 6 

33.755 78.075 24.854 1.00 59.52 6 

•34.702 75.796 24.554 1.00 60.66 6 

34.423 75.306 23.458 1.00 60.77 8 

34.717 75.080 25.675 1.00 58.74 7 

34.409 73.653 25.659 1.00 57.24 6 

35.621 72.826 26.109 1.00 55.18 6 

36.737 72.825 25.097 1.00 55.96 6 

37.694 73.8.42 25.075 1.00 56.31 6 

38.680 73.887 24.092 1.00 56.74 6 

36.800 71.844 24.109 1.00 56.77 6 
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ATOM 4827 ODl ASP C 194 

ATOM 4828 OD2 ASP C 194 

ATOM 4829 C ASP C 194 

ATOM 4830 O ASP C 194 

ATOM 4831 N VAL C 195 

ATOM 4832 CA VAL C 195 

ATOM 4833 CB VAL C 195 

ATOM 4834 CGI VAL C 195 

ATOM 4835 CG2 VAL C 195 
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37.785 71.874 23.121 1.00 57.49 6 
38.720 72.898 23.119 1.00 58.13 6 
39.689 72.918 22.148 1.00 58.55 8 
33.204 73.295 26.508 1.00 57.26 6 
33.292 73.241 27.736 1.00 57.40 8 
32.084 73.042 25.836 1.00 57.28 7 
30.839 72.678 26.506 1.00 58.77 6 
29.681 72.597 25.495 1.00 59.02 6 
29.342 73.912 24.823 1.00 61.64 6 
28.118 73.827 23.945 1.00 63.79 6 
27.107 73.242 24.411 1.00 66.05 8 
28.171 74.355 22.805 1.00 63.82 8 
30.968 71.336 27.224 1.00 58.56 6 
31.749 70.471 26.811 1.00 58.35 8 
30.197 71.171 28.296 1.00 57.04 7 
30.215 69.940 29.059 1.00 57.30 6 
31.473 69.849 29.932 1.00 58.11 6 
31.430 70.777 31.141 1.00 58.91 6 
32.220 71.742 31.172 1.00 57.40 8 
30.614 70.538 32.060 1.00 58.46 8 
28.982 69.848 29.933 1.00 56.78 6 
28.411 70.864 30.320 1.00 56.90 8 
28.567 68.619 30.223 1.00 55.47 7 
27.404 68.373 31.061 1.00 55.52 6 
26.538 67.236 30.504 1.00 54.50 6 
25.469 66.841 31.511 1.00 52.44 6 
25.914 67.671 29.199 1.00 55.96 6 
27.874 67.973 32.444 1.00 56.12 6 
28.661 67.046 32.602 1.00 56.74 8 
27.388 68.675 33.451 1.00 56.91 7 
27.777 68.370 34.816 1.00 57.46 6 
28.051 69.654 35.581 1.00 58.09 6 
28.548 69.445 36.972 1.00 59.59 6 
28.730 70.758 37.700 1.00 62.33 6 
29.523 71.594 37.229 1.00 62.53 8 
28.074 70.958 38.746 1.00 65.85 8 
26.636 67.627 35.453 1.00 57.05 6 
25.517 68.120 35.487 1.00 59.06 8 
26.914 66.427 35.938 1.00 56.46 7 
25.889 " 65.612 36.566 1.00 55.58 6 
25.867 64.179 35.984 1.00 53.38 6 
24.777 63.360 36.649 1.00 50.20 6 
25.629 64.239 34.486 1.00 51.56 6 
26.199 65.546 38.041 1.00 56.50 6 
27.320 65.207 38.430 1.00 58.31 8 
25.214 65.885 38.866 1.00 57.35 7 
25.407 65,857 40.309 1.00 57.08 6 
24.867 67.137 40.943 1.00 56.30 6 
25.661 68.246 40.565 1.00 55.97 8 
24.717 64.643 40.886 1.00 57.36 6 
23.513 64.460 40.731 1.00 58.49 8 
25.494 63.791 41.531 1.00 57.83 7 
24.938 62.594 42.120 1.00 57.20 6 
25.824 61.383 41.834 1.00 56.00 6 
25.457 60.110 42.606 1.00 56.43 6 
24.073 59.627 42.211 1.00 53. .65 6 
26.487 59.050 42.325 1.00 55.64 6 

24.818 62.782 43.613 1.00 58.48 6 

25.819 62.737 44.337 1.00 59.73 8 
23.593 63.012 44.070 1.00 58.64 7 
23.355 63.168 45.489 1.00 58.44 6 
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200 22.285 64.232 

200 21.985 64.387 

200 22.905 64.419 

200 20.700 64.480 

200 22.898 61.801 

200 21.821 61.316 

201 23.739 61.181 
201 23.454 59.867 
201 24.169 58.807 
201 25.663 58.820 
201 26.312 57.763 
201 26.413 59.912 
201 27.689 57.794 
201 27.782 59.955 
201 28.425 58.889 
201 23.979 59.812 

201 24.640 60.739 

202 23.698 58.702 
202 24.140 58.511 
202 23.192 59.224 
202 21.844 58.529 
202 20.831 59.278 
202 19.555 58.567 
202 18.430 59.062 
202 18.420 60.270 
202 17.315 58.349 
202 24.116 57.023 

202 23.439 56.231 

203 24.860 56.651 
203 24.886 55.270 
203 25.931 55.088 
203 25.988 53.688 
203 26.955 53.617 
203 27.036 52.209 
203 27.621 51.250 
203 23.477 55.037 

203 22.878 55.942 

204 22.913 53.858 
204 21.584 53.606 
204 21.017 52.329 
204 20.591 52.487 
204 20.445 51.140 
204 19.975 51.298 
204 19.967 50.027 
204 21.742 53.460 

204 22.711 52.854 

205 20.811 54.022 
205 20.898 53.921 
205 19.797 53.078 
205 18.911 52.614 
205 19.811 52.879 

1 39.182 71.754 

1 40.239 71.385 

1 40.122 70.241 

1 41.397 72.176 

1 41.141 69.886 

1 42.418 71.835 

1 42.289 70.688 

1 37.071 70.999 

1 37.607 71.392 

1 37.010 72.284 



45.754 1.00 60.17 6 

47.239 1.00 62.50 6 

48.070 1.00 64.15 8 

47.583 1.00 61.21 7 

45.988 1.00 55:97 6 

45.627 1.00 54.75 8 

46.802 1.00 53.51 7 

47.330 1.00 53.54 6 

46.503 1.00 50.74 6 

46.677 1.00 48.36 6 

47.309 1.00 45.72 6 

46.252 1.00 46.85 6 
47.521 1.00 43.95 6 

46.457 1.00 45.93 6 
47.096 1.00 44.09 6 
48.749 1.00 55.36 6 
49.210 1.00 53.66 8 
49.425 1.00 57.35 7 
50.792 1.00 59.95 6 
51.744 1.00 62.36 6 
51.789 1.00 64.26 6 
52.610 1.00 67.26 6 
52.620 1.00 70.54 7 
53.129 1.00 69.46 6 
53.670 1.00 69.25 7 
53.095 1.00 69.56 7 
51.119 1.00 60.80 6 
50.445 1.00 60.45 8 
52.158 1.00 61.42 7 
52.603 1.00 60.82 6 
53.703 1.00 59.66 6 
54.258 1.00 61.19 6 
55.404 1.00 63.27 6 
55.947 1.00 66.98 6 
54.945 1.00 68.80 7 
53.157 1.00 60.81 6 
53.735 1.00 60.52 8 
52.944 1.00 60.36 7 
53.466 1.00 60.22 6 
52.837 1.00 58.64 6 
51.381 1.00 55.82 6 
50.692 1.00 51.95 6 

49.253 1.00 52.70 6 

48.458 1.00 51.76 7 
54.977 1.00 62.01 6 
55.440 1.00 63.36 8 
55.747 1.00 62.55 7 
57.202 1.00 62.30 6 
57.844 1.00 61.75 6 
57.093 1.00 60.18 8 
59.092 1.00 60.86 8 

1.648 1.00 71.47 6 

0.623 1.00 73.60 6 

-0.169 1.00 75.22 6 

0.493 1.00 73.83 6 

-1.081 1.00 75.00 6 

-0.410 1.00 73.07 6 

-1.199 1.00 74.63 6 

2.658 1.00 68.35 6 

3.688 1.00 69.33 8 

0.515 1.00 69.08 7 
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37.756 71.268 1.321 1.00 69.54 6 

35.915 70.338 2.678 1.00 67.84 7 

35.253 70.061 3.967 1.00 66.03 6 

33.949 69.266 3.772 1.00 66.47 6 

34.138 68.032 2.928 1.00 68.89 6 

35.029 67.218 3.287 1.00 68.16 8 

33.396 67.886 1.912 1.00 70.47 8 

36.181 69.310 4.933 1.00 64.63 6 

37.378 69.165 4.672 1.00 64.43 8 

35.639 68.837 6.049 1.00 62.26 7 

36.461 68.128 7.029 1.00 60.44 6 

35.748 68.078 8.388 1.00 60.90 6 

36.068 69.254 9.302 1.00 60.80 6 

35.185 69.243 10.532 1.00 65.18 6 

35.849 69.838 11.696 1.00 66.90 7 

36.028 71.141 11.888 1.00 66.63 6 

35.591 72.031 11.002 1.00 68.16 7 
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58.652 
59.657 
57.517 
57.373 
56.794 
55.514 
56.442 
"55.397 
56.845 
56.089 
56.911 
56.015 
57.573 
55.753 
56.607 
54.508 



68.079 
67.105 
68.570 
67.975 
68.283 
68.719 
67.908 
69.881 
68.594 
68.347 
69.409 
70.062 
71.189 
72.455 
73.597 
74.866 
76.024 
76.073 
77.136 
69.091 
68.303 
69.127 
69.831 
68.252 
68.450 
68.884 
68.738 
69.874 
67.870 
68.258 
67.036 
67.401 
66.579 
68.896 
68.337 
70.088 
70.767 
72.269 
70.371 
70.741 
69.603 
69.127 
67.599 
67.162 
66.868 
69.805 
69.806 
70.395 
71.049 
72.449 
72.393 
70.188 
69.746 
69.947 
69.130 
67.898 
66.914 
67.234 
69.947 
70.632 
69.850. 



8.399 
8.907 
8.875 
10.055 
10.123 
8.780 
7.806 
8.712 
11.274 
12.414 
11.026 
12.111 
11.585 
11.150 
10.942 
10.584 
10.463 
10.673 
10.142 
12.901 
12.325 
14.237 
15.020 
15.101 
16.483 
16.165 
15.095 
14.713 
15.510 
15.596 
15.494 
15.998 
14.052 
16.968 
17.961 
17.019 
18.287 
18.095 
18.923 
18.440 
20.006 
20.775 
21.016 
21.871 
19.692 
22.131 
22.852 
22.471 
23.754 
23.575 
22.966 
24.598 
24.126 
25.841 
26.780 
27 .241 
27.986 
26.043 
28.015 
28.570 
28.458 



1.00 87.43 
1.00 88.35 
1.00 87.68 
1.00 86.96 
1.00 91.11 
1.00 94.40 
1.00 95.87 
1.00 95.25 
1.00 85.02 
1.00 83.40 
1.00 82.95 
1.00 81.58 
1.00 83.67 
1.00 88.52 
1.00 92.65 
1.00 96.29 
1.00 98.03 
1.00 98.35 
1.00 98.69 
1.00 78.66 
1.00 79.51 
1.00 74.94 
1.00 73.76 
1.00 71.13 
1.00 71.49 
1.00 72.18 
1.00 67.33 
1.00 65.48 
1.00 63.87 
1.00 58.65 
1.00 57.16 
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00 55.97 
00 56.42 
00 55.56 
00 55.33 
00 54.28 
1.00 54.43 
1.00 52.62 
1.00 50.60 
1.00 48.93 
1.00 46.20 
1.00 47.60 
1.00 50.77 
1.00 51.59 
1.00 50.31 
1.00 52.56 
1.00 53.44 
1.00 56.80 
1.00 53.29 
1.00 55.20 
1.00 54.04 
1.00 56.25 
1.00 58.16 
1.00 56.17 
1.00 58.28 
1.00 57.41 
1.00 58.01 
1.00 59.19 
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54,062 
53.467 
52.892 
53.023 
51.906 
53.390 
52.468 
53.217 
54.192 
54.665 
53.513 
51.623 
52.136 
50.327 
49.422 
48.311 
48.802 
49.581 
50.100 
50.855 
48.838 
48.123 
49.144 
48.616 
49.441 
50.838 
51.823 
51.159 
53.109 
52.449 
53.424 
47.136 
46.626 
46.451 
45.030 
44.352 
42.850 
44.752 
44.398 
44.719 
43.677 
45.612 
45.363 
44.117 
43.392 
44.015 
42.068 
46.539 
47.380 
46.588 
47.612 
48.496 
•49.473 
49.220 
49.944 
46.795 
46.169 
46.802 
45.979 
45.489 
44.834 



70.594 

71.934 

72.641 

69.799 

69.572 

69.379 

68.629 

67.702 

66.726 

65.709 

66.034 

69.572 

70.542 

69.294 

70.089 

70.694 

71.519 

72.764 

73.559 

74.801 

69.065 

68.167 

69.173 

68.200 

68.199 

67.733 

68.617 

66.387 

68.170 

65.925 

66.818 

68.352 

69.449 

67.217 

67.156 

66.334 

66.369 

66.877 

68.324 

66.525 

66.798 

65.678 

65.030 

64.140 

63.990 

63.785 

64.085 

64.186 

63.752 

63.977 

63.163 

64.013 

63.140 

65.028 

66.110 

62.183 

62.565 

60.916 

59.874 

58.944 

59.723 



29.628 

29.175 

30.256 

30.433 

29.966 

31.638 

32.472 

33.426 

32.775 

33.818 

31.614 

33.291 

33.829 

33.366 

34.171 

33.320 

32.146 

32.581 

31.364 

31.747 

35.118 

34.685 

36.405 

37.346 

38.616 

38.393 

37.964 

38.530 

37.668 

38.235 

37.802 

37.642 

37.869 

37.600 

37.827 

36.731 

36.914 

35.360 

35.152 

39.164 

39.743 

39.661 

40.939 

40.834 

42.159 

43.200 

42.124 

41.426 

40.640 

42.740 

43.372 

44.246 

44.989 

43.359 

44.084 

44.190 

45.162 

43.777 

44.388 

43.287 

42.141 



00 59.47 
00 60.04 
,00 62.30 
,00 58.78 
,00 59.54 



1.00 56.81 
1.00 56.85 



00 54.23 
00 55.08 
.00 53.15 
,00 54.80 
,00 56.91 
.00 59.29 



1.00 56.10 
1.00 55.13 
1.00 58.38 
1.00 65.52 
1.00 .69.06 
1.00 72.37 
1.00 72.05 
1.00 53.80 
1.00 55.04 



,00 51.60 
.00 49.19 
.00 46.49 
.00 48.31 



1.00 46.94 
1.00 49.16 
1.00 48.04 
1.00 50.28 
1.00 48.13 
1.00 49.07 
1.00 49.57 
1.00 47.68 
1.00 44.72 
1.00 43.48 
1.00 41.45 
1.00 43.86 
1.00 46.64 
1.00 44.81 
1.00 46.51 
00 42.08 
00 42.31 
00 41.66 



.88 



00 43, 
00 43.62 
00 38.41 
00 43.81 
00 40.93 
1.00 43.93 
1.00 44.58 
00 42.93 
00 39.88 



00 43.19 
00 44.94 
00 47.97 
1.00 50.60 
1.00 50.68 
1.00 51.24 
1.00 51.95 
1.00 54.64 



6 

6 

8 

6 

8 

7 

6 

6 

6 

6 

6 

6 

8 

7 

6 

6 

6 

6 

6 

7 

6 

8 

7 

6 

6 

6 

6 

6 

6 

6 

6 

6 

8 

7 

6 

6 

6 

6 

6 

6 

8 

7 

6 

6 

6 

8 

7 

6 

8 

7 

6 

6 

6 

6 

6 

6 

8 

7 

6 

6 

6 



WO 01/58951 



PCT7EP01/01457 



ATOM 


5236 


CD1 


LEU D 


39 


ATOM 


5237 


CD2 


LEU D 


39 


ATOM 


5238 


C 


LEU D 


39 


ATOM 


5239 


0 


LEU D 


39 


ATOM 


5240 


N 


GLU D 


A A 

40 


ATOM 


5241 


CA 


GLU D 


40 


ATOM 


5242 


CB 


GLU D 


40 


ATOM 


5243 


CG 


GLU D 


40 


ATOM 


5244 


CD 


GLU D 


40 


ATOM 


5245 


OE1 


GLU D 


40 


ATOM 


5246 


OE2 


GLU D 


40 


ATOM 


5247 


C 


GLU D 


40 


ATOM 


5248 


0 


GLU D 


40 


ATOM 


5249 


N 


VAL D 


41 


ATOM 


5250 


CA 


VAL D 


41 


ATOM 


5251 


CB 


VAL D 


41 


ATOM 


5252 


CGI 


VAL D 


41 


ATOM 


5253 


CG2 


VAL D 


41 


ATOM 


5254 


C 


VAL D 


41 


ATOM 


5255 


O 


VAL D 


41 


ATOM 


5256 


N 


ASN D 


42 


ATOM 


5257 


CA 


ASN D 


42 


ATOM 


5258 


CB 


ASN D 


42 


ATOM 


5259 


CG 


ASN D 


42 


ATOM 


5260 


ODl 


ASN D 


42 


ATOM 


5261 


ND2 


ASN D 


42 


ATOM 


5262 


C 


ASN D 


42 


ATOM 


5263 


0 


ASN D 


42 


ATOM 


5264 


N 


GLU D 


43 


ATOM 


5265 


CA 


GLU D 


43 


ATOM 


5266 


CB 


GLU D 


43 


ATOM 


.5267 


CG 


GLU D 


43 


ATOM 


5268 


CD 


GLU D 


43 



ATOM 5269 OEl GLU D 43 

ATOM 5270 OE2 GLU D 43 
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ATOM 5272 O GLU D 43 
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ATOM 5288 N ASN D 46 
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44.356 58.777 41.068 1.00 53.53 6 
43.666 60.542 42.696 1.00 53.70 6 
46.520 59.041 45.529 1.00 53.82 6 
45.793 58.750 46.479 1.00 54.77 8 
47.771 58.620 45.437 1.00 54.21 7 
48.349 57.825 46.507 1.00 56.24 6 
48.278 56.339 46.204 1.00 58.19 6 
46.873 55.801 46.103 1.00 65.00 6 
46.844 54.291 45.938 1.00 68.23 6 - 
47.443 53.791 44.955 1.00 70.63 8 
46.226 53.611 46.789 1.00 68.96 8 
49.785 58.198 46.702 1.00 56.38 6 
50.541 58.355 45.746 1.00 59.25 8 
50.162 58.343 47.955 1.00 55.49 7 
51.517 58.695 48.273 1.00 54.71 6 
51.590 60.145 48.811 1.00 55.08 6 
52.954 60.431 49.361 1.00 55.65 6 
51.273 61.125 47.696 1.00 55.07 6 
52.003 57.713 49.309 1.00 54.17 6 
51.232 57.239 50.136 1.00 53.51 8 
53.280 57.381 49.233 1.00 54.90 7 
53.880 56.473 50.182 1.00 56.47 6 
53.944 55.056 49.612 1.00 55.97 6 
54.306 54.025 50.661 1.00 56.38 6 
55.272 54.191 51.408 1.00 55.15 8 
53.536 52.947 50.717 1.00 56.12 7 
55.278 57.022 50.420 1.00 58.56 6 
56.154 56.912 49.567 1.00 58.83 8 
55.474 57.639 51.579 1.00 59.69 7 
56.771 58.208 51.905 1.00 60.97 6 
56.640 59.192 53.065 1.00 63.50 6 
57.921 59.959 53.341 1.00 67.19 6 
57.725 61.121 54.303 1.00 68.84 6 
58.743 61.734 54.682 1.00 70.83 8 
56.568 61.427 54.670 1.00 67.80 8 
57.792 57.134 52.246 1.00 59.80 6 
58.993 57.356 52.138 1.00 59.85 8 
57.301 55.969 52.653 1.00 59.27 7 
58.164 54.858 53.006 1.00 59.05 6 
57.373 53.723 53.681 1.00 59.62 6 
58.300 52.527 53.945 1.00 59.50 6 
56.752 54.217 54.982 1.00 58.87 6 
57.768 54.484 56.073 1.00 60.45 6 
58.801 54.286 51.751 1.00 59.04 6 
60.001 54.029 51.723 1.00 60.06 8 
57.986 54.080 50.719 1.00 57.12 7 
58.461 53.513 49.464 1.00 54.05 6 
57.410 52.576 48.857 1.00 52.43 6 
56.204 53.304 48.628 1.00 49.39 8 
57.128 51.426 49.788 1.00 49.70 6 
58.833 54.551 48.417 1.00 53.32 6 
59.427 54.215 47.397 1.00 56.15 8 
58.493 55.809 48.666 1.00 51.82 7 
58.796 56.874 47.723 1.00 51.33 6 
60.305 57.022 47.567 1.00 51.66 6 
60.874 58.117 48.434 1.00 52.73 6 
62.057 58:105 48.765 1.00 52.55 8 
60.041 59.078 48.793 1.00 50.90 7 
58.156 56.618 46.360 1.00 50.97 6 
58.820 56.668 45.325 1.00 53.17 8 
56.858 56.348 46.371 1.00 48.99 7 
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56.118 56.091 45.155 1.00 48.91 6 

55.717 54.627 45.105 1.00 46.66 6 

56.888 53.678 45.006 1.00 47.02 6 

56.458 52.227 45.018 1.00 49.64 6 

55.302 51.956 44.644 1.00 48.30 8 

57.276 51.357 45.391 1.00 51.88 8 

54.888 56.992 45.111 1.00 50.12 6 

54.222 57.202 46.125 1.00 50.30 8 

54.591 57.531 43.936 1.00 50.20 7 

53.455 58.423 43.793 1.00 52.05 6 

53.925 59.861 43.502 1.00 53.36 6 

52.727 60.778 43.352 1.00 56.39 6 

54.801 60.352 44.620 1.00 54.18 6 

52.522 57.995 42.673 1.00 51.50 6 

52.962 57.552 41.617 1.00 53.49 8 

51.231 58.137 42.910 1.00 49.46 7 

50.241 57.777 41.920 1.00 49.81 6 

49.241 56.813 42.535 1.00 52.06 6 

48.447 56.086 41.508 1.00 53.16 6 

48.086 56.719 40.499 1.00 53.36 8 

48.176 54.887 41.717 1.00 57.15 8 

49.583 59.105 41.580 1.00 49.01 6 

48.818 59.646 42.373 1.00 48.57 8 
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72.073 55.601 15.478 1.00 69.74 6 

72.311 56.794 15.711 1.00 71.52 8 

72.005 54.676 16.437 1.00 70.16 7 

68.850 56.189 11.932 1.00 60.94 6 

68.599 57.336 11.548 1.00 60.74 • 8 

67.910 55.318 12.281 1.00 57.54 7 

66.495 55.652 12.254 1.00 54.00 6 

65.857 55.296 10.901 1.00 52.88 6 

66.391 56.201 9.814 1.00 52.07 6 

66.151 53.846 10.564 1.00 53.11 6 

65.816 54.844 13.349 1.00 51.46 6 

66.355 53.838 13.804 1.00 51.44 8 

64.649 55.299 13.789 1.00 48.05 7 

63.893 54.592 14.812 1.00 45.32 6. 

63.222 55.583 15.761 1.00 44.86 6 

64.122 56.022 16.763 1.00 42.99 8 

62.846 53.727 14.123 1.00 45.51 6 

61.959 54.228 13.431 1.00 45.94 8 

62.953 52.420 14.325 1.00.44.89 7 

62.052 51.463 13.706 1.00 43.76 6 

62.857 50.400 12.943 1.00 44.31 6 

61.930 49.411 12.296 1.00 45.29 6 

63.734 51.060 11.907 1.00 45.23 6 

61.169 50.751 14.718 1.00 43.81 6 

61.641 50.331 15.772 1.00 46.08 8 

59.868 50.612 14.418 1.00 41.91 7 

59.085 51.238 13.344 1.00 41.05 6 

58.987 49.923 15.360 1.00 39.46 6 

57.619 50.076 14.719 1.00 39.68 6 
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31.694 22.169 23.679 1.00109.78 6 
32.042 22.822 25.000 1.00106.01 6 
33.258 20.667 25.468 1.00 99.68 6 
34.107 19.901 26.462 1.00 97.90 6 
33.049 21.220 28.203 1.00 99.77 6 
32.154 21.953 27.266 1.00101.85 6 
34.051 19.067 28.833 1.00 98.50 6 
35.030 19.802 29.773 1.00 97.76 6 
34.441 19.860 31.064 1.00 96.20 8 
32.880 22.043 25.968 1.00103.11 7 
31.207 23.336 22.418 1.00113.66 16 
31.826 22.878 21.182 1.00113.59 8 
31.477 24.685 22.941 1.00111.64 8 

29.701 23.322 22.307 1.00111.51 8 
33.333 19.839 27.737 1.00 98.16 7 
19.833 49.708 24.248 1.00108.88 6 
20.653 49.684 25.518 1.00104.63 6 

19.090 47.814 26.172 1.00100.27 6 
18.728 46.788 27.241 1.00 98.30 6 

19.702 48.326 28.859 1.00100.12 6 
20.022 49.380 27.845 1.00101.78 6 
18.080 46.646 29.663 1.00 97.02 6 
19.186 46.065 30.550 1.00 96.36 6 
19.161 46.714 31.805 1.00 95.77 8 
20.281 48.676 26.560 1.00102.78 7 
20.640 50.530 22.892 1.00112.59 16 
20.348 49.765 21.697 1.00113.24 8 
22.024 50.791 23.309 1.00111.25 8 
20.059 51.924 22.770 1.00111.32 8 
18.478 47.544 28.507 1.00 98.39 7 
42.028 70.369 23.900 1.00109.55 6 

42.091 69.312 25.003 1.00104.54 6 
40.178 70.594 25.988 1.00 97.17 6 
39.192 70.688 27.124 1.00 96.78 6 
40.844 69.540 28.502 1.00 98.62 6 
41.854 69.476 27.408 1.00100.28 6 

39.103 70.891 29.639 1.00 97.37 6 
38.600 69.616 30.336 1.00 97.34 6 
39.310 69.466 31.552 1.00 95.42 8 

41.104 69.418 26.123 1.00100.25 7 
42.851 69.897 22.390 1.00113.40 16 
41.999 70.352 21.301 1.00112.83 8 
43.216 68.464 22.535 1.00111.97 8 
44.208 70.580 22.342 1.00111.22 8 
39.982 70.756 28.394 1.00 97.91 7 
67.843 54.529 23.109 1.00109.23 6 
67.696 54.053 24.549 1.00105.70 6 
67.448 56.488 25.071 1.00101.29 6 
67.199 57.526 26.150 1.00 99.73 6 
68.223 55.960 27.701 1.00100.32 6 
68.526 54.948 26.646 1.00101.68 6 
68.126 58.362 28.296 1.00 99.92 6 
67.284 58.007 29.528 1.00100.01 6 
68.179 57.751 30.594 1.00100.41 8 
67.472 55.082 25.609 1.00103.08 7 
67.556 53.252 21.895 1.00112.53 16 
66.829 53.860 20.813 1.00112.71 8 
67.011 52.080 22.599 1.00111.24 8 
68.908 52.765 21.423 1.00111.51 8 
68.258 57.354 27.169 1.00 99.66 7 
62.836 24.327 23.511 1.00108.64 6 
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Atom 

Type Residue # X Y Z OCC B 



Table 1: Structural coordinates of AChBP 

"Atom type" refers to the element whoose coordinate are measured. The first letter in 
the column defines the element. 

"Residue" refers to the amino acid in the AChBP protein sequence, using the 
standard three letter abbreviations known in the art. 
"#" refers to the residue number. 

"X, Y, T crystallographically define the atomic position, in three-dimensional space, 
of the element measured. 
"OCC" is the occupancy volume. 

"B n is a thermal factor that measures movement of the atom around its atomic center. 
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Claims 



1 . A water-soluble protein derived from a mollusc being capable of binding a 
ligand of a ligand-gated receptor. 

5 

2. The protein of claim 1, wherein the ligand is acetylcholine, gamma-amino- 
butyric acid (GABA), glycine or serotonin. 

3. The protein of claim 2, wherein said protein is a acetylcholine-binding protein 
10 (AChBP). 

4. The protein of any one of claim 1 to 3 which is capable of forming multimers. 

5. The protein of any one of claims 1 to 4 which is derived form a Pulmonata 
1 5 species, preferably from a Basommatophora species. 

6. The protein of any one of claims 1 to 5 comprising an amino acid sequence 
selected from the group consisting of: 

(a) an amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 
20 6 or 8 or a functional equivalent thereof, or a fragment of at least 5 

continuous amino acids thereof; 

(b) an amino acid sequence having at least 30% amino acid identity to the 
amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8. 

25 7. A water-soluble ligand binding protein capable of binding a ligand of a ligand- 
gated receptor and comprising at least 5 continuous amino acids of the 
aminoacid sequence depicted in any one of SEQ ID Nos. 2, 4, 6 or 8 and/or 
said protein is detectable by a monoclonal or polyclonal antibody which 
recognises, preferably with a binding affinity of at least 10" 7 M, a protein of any 

30 one of claims 1 to 6. 

8. A water-soluble protein being capable of binding a ligand of a ligand-gated 
receptor comprising 

(a) at least the amino acids of the water-soluble protein of any one of 
35 claims 1 to 6 determining solubility of said protein, in the same or 

corresponding positions as in said protein; and 

(b) at least 4 amino acids determining binding to said ligand. 
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9. The protein of claim 7 or 8 which is capable of forming multimers. 

1 0. The protein of any one of claims 7 to 9 comprising 200-240 amino acids. 

5 

1 1 . The protein of any one of claims 7 to 10, wherein the ligand is acetylcholine, 
nicotine, lophotoxin, d-tubocurarine, carbamylcholine, galanthamine or 
epibatidine. 



10 12. The protein of any one of claims 1 to 1 1 , wherein said ligand-gated receptor 
is derived from an arthropod (preferably insect), a plant (preferably a higher 
plant, most preferably a seed plant) or a chordate (preferably a mammalian, 
most preferably human). 

15 13. The protein of any one of claims 7 to 1 2, wherein said ligand-gated receptor is 
a nicotinic acetylcholine receptor. 

The protein of any one of claims 7 to 13, wherein said amino acids 
determining solubility are in the same positions as in the AChBP having the 
amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 6 or 8; 
preferably in which said solubility-determining amino acids comprise solvent 
accessible regions in the crystal structure according to Figure 10. 

The protein of any one of claims 7 to 14 comprising an amino acid sequence 
having at least 40% amino acid identity to the amino acid sequence 20-223 of 
any one of SEQ ID Nos. 2, 4, 6 or 8, in which the ligand binding amino acids 
have been replaced with the corresponding amino acids of a ligand-gated 
receptor. 

30 16. The protein of any one of claims 7 to 15, in which said solubility-determining 
amino acids (a) comprise hydrophilic amino acids (Asp, Glu, Arg, Lys) from 
the sequences 20-44, 73-81,-86-92, 112-120, 135-152, 166-189, 196-20, 
209-21 3, and/or 21 9-227 of SEQ ID No. 2. 



14. 

20 



15. 

25 



35 17. The protein of claim 16, in which said solubility determining amino acids (a) 
comprise amino acids Asp(36), Asp(68), Glu(115), Arg(137), Asp(143), 
Asp(148), GIu(150), Arg(167), Arg(189). Glu(215) of SEQ ID No.2, wherein 
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Asp may be exchanged for Glu and vice versa and Lys may be exchanged for 
Arg and vice versa. 

18. The protein of any one of claims 7 to 17 which further comprises the amino 
5 acids Cys(142), Thr(149), Ala(153), Thr(154), Cys(155), Arg(156), lle(157) 

and/or Lys(1 58) of SEQ ID No. 2. 

19. The protein of any one of claims 7 to 17 which comprises the amino acids (b) 
Pro(39), Trp(77), Trp(101), Pro(103), Asp(194), and/or Ser(161) of SEQ ID 

10 No. 2. 

20. The protein of any one of claims 7 to 19 in which the amino acid sequences 
165-169 and/or 200-203 of SEQ ID No. 2 have been exchanged with the 
corresponding sequence of the ligand-gated receptor. 

15 

21 . The protein of any one of claims 7 to 20 which is capable of binding a ligand 
of an acetylcholine receptor, in which at least one of the amino acid 
sequences Trp(101) - Tyr(T08), Trp(162) - His(164) and Tyr(204) - Tyr(211) 
of SEQ ID No. 2 have been exchanged with the corresponding sequence of 

20 the acetylcholine receptor. 

22. A method for the production of a water-soluble ligand-gated receptor or a 
corresponding ligand-binding domain or for improving the water solubility and 
accessibility to crystallization of such a receptor or domain, said method 

25 comprising altering the amino acid sequence of the extracellular domain of a 

ligand-gated receptor by way of substituting, adding, deleting or modifying at 
least one amino acid at a position corresponding to an amino acid 
determining or contributing to the water-solubility of the protein of any one of 
claims 1 to 21 . 



30 



23. The method of claim 22, wherein the ligand-gated receptor is defined as in 
any one of claims 1 to 21 . 



24. 

35 



The method of claim 22 or 23, wherein at least one amino acid is altered to 
the corresponding amino acid of the amino acid sequence depicted in any 
one of SEQ ID Nos. 2, 4, 6 or 8, or to a an equivalent amino acid, preferably 
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in which said solubility-determining amino acids comprise solvent accessible 
regions in the crystal structure according to Figure 10. 

25. The method of any one of claims 22 to 24, wherein loop Cys1 23-Cys1 36 of 
5 SEQ ID No. 2 is inserted into the corresponding region of the ligand binding 

domain of the ligand-gated receptor. 

26. The method of any one of claims 22 to 25 further comprising 

(a) culturing a host cell transfected with and capable of expressing a 

1 0 polynucleotide comprising a nucleotide sequence encoding the altered 

amino acid sequence; and optionally 

(b) recovering said water-soluble ligand-gated receptor or corresponding 
Iigand-binding domain from the culture. 

15 27. A water-soluble ligand-gated receptor or iigand-binding domain obtainable by 
the method of any one of claims 22 to 26. 

28. The protein of any one of claims 1 to 21 or 27 further comprising a spacer 
sequence allowing coupling with a carrier body. 

20 

29. A fusion protein comprising the water-soluble Iigand-binding protein of any 
one of claims 1 to 21, 27 or 28, or a binding fragment thereof and a fragment 
of a ligand-gated receptor 

25 30. A dimer or pentamer consisting of at least one monomer comprising a protein 
of any one of claims 1 to 21 or 27 to 29. 

31 . A ligand-gated ion channel comprising a protein of any one of claims 1 to 21 
or 27 to 29 or the dimer or pentamer of claim 30. 

30 

32. One or more polynucleotides encoding the protein of any one of claims 1 to 
21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand-gated ion 
channel of claim 31 . 

35 33. The polynucleotide(s) of claim 32 which comprise(s) 
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(a) a nucleotide sequence having at least 1 5 continuous nucleotides of 
the nucleotide sequence depicted in any one of SEQ ID Nos. 1 , 3, 5 or 
7 or a degenerated sequence thereof; or 

(b) a nucleotide sequence capable of hybridizing to a nucleotide 
5 sequence of (a) under stringent hybridisation conditions. 

34. The polynucleotide(s) of claim 32 or 33 which is(are) operatively linked to 
heterologous expression control sequences allowing expression inprokaryotic 
or eukaryotic cells. . 

10 

35. One or more vector(s) containing the polynucleotide(s) ,of any one of claims 
32 to 34. 

36. A host cell genetically engineered with the polynucleotide(s) of any one claims 
15 32 to 34 or with the vector(s) of claim 35. 

37. An antigen comprising an epitope of at least 5 continuous amino acids of the 
amino acid sequence depicted in any one of SEQ ID Nos. 2, 4, 6 or 8 and/or 
said epitope is detectable by a monoclonal or polyclonal antibody which 

20 recognises, preferably with a binding affinity of at least 10" 7 M, a protein of any 

one of claims 1 to 6. 

38. An antibody specifically recognizing the protein of any one of claims 1 to 21 or 
27 to 29, the dimer or pentamer of claim 30, the ligand-gated ion channel of 

25 claim 31 or the antigen of claim 37. 

39. An oligonucleotide probe comprising a nucleotide sequence having at least 15 
continuous nucleotides of a polynucleotide of any one claims 32 to 34 or 
encoding the antigen of claim 37. 

30 

40. A composition comprising the protein of any one of claims 1 to 21 or 27 to 29, 
the dimer or pentamer of claim 30, the ligand-gated ion channel of claim 31, 
the polynucleotide(s) of any one claims 32 to 34, the vector(s) of claim 35, the 
host cell of claim 36, the antigen of claim 37, the antibody of claim 38, or an 

35 oligonucleotide probe of claim 39; and optionally suitable means for detection 

or performing a ligand-receptor binding assay. 
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41 . A method for identifying an agonist/activator or antagonist/inhibitor of a ligand- 
gated receptor comprising the steps of: 

(a) contacting the water-soluble ligand-binding protein of any one of 
claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30, the 
5 ligand-gated ion channel of claim 31 or a cell expressing said 

protein in the presence of components capable of providing a 
detectable signal in response to ligand binding with a compound to be 
screened under conditions that permit binding of said compound 
to the ligand-binding protein; and 
1 0 (b) detecting the presence or absence of a signal generated from the 

binding activity of the ligand-binding protein, wherein the 
presence/increase and absence/decrease of the signal is indicative for 
an agonist/activator and antagonist/inhibitor, respectively, of a ligand- 
gated receptor. 

15 

42. A crystal of a protein of any one of claims 1 to 21 or 27 to 29, the dimer or 
pentamer of claim 30 or the ligand-gated ion channel of claim 31 . 

43. A crystal of a protein-ligand complex comprising a protein of any one of 
20 claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand- 
gated ion channel of claim 31; and a ligand. 

44. The crystal of claim 43, wherein the ligand comprises an N-alkylated 
hydroxyalkyl and/or a quaternary ammonium ion. 



25 



30 



35 



45. The crystal of claim 43, wherein the ligand comprises 4-(2-Hydroxyethyl)-1- 
piperazineethanesulfonic acid (HEPES), B-bippinatin, lophotoxin, d- 
tubocurarine, carbamylcholine, galanthamine, epibatidine or alpha- 
bungarotoxin. 

46. The crystal of any one of claims 42 to 45, wherein the crystal effectively 
diffracts X-rays for the determination of the atomic coordinates of the protein 
or protein-ligand complex to a resolution of greater than 5.0, preferably 
greater than 4.0 Angstroms. 

47. The crystal of any one claims 42 to 46, wherein the protein has an amino acid 
sequence of amino acids 20 to 223 of SEQ ID No. 2, or an amino acid 
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sequence that differs from amino acid 20 to 223 of SEQ ID No. 2 by only 
having conservative substitutions. 

48. The crystal of claim 47, wherein the ligand is HEPES. 

5 

49. The crystal of claim 46 having (1) a space group of P2,2fo and a unit cell of 
dimensions of a=120.6A, b=137.0A and c=161.5A; (2) a space group of 
P4 2 2i2 and a unit cell of dimensions of a=b=141.6A and c=120.8A or (3) a 
space group of Pl\ and a unit cell of dimensions of a=121.lA, b=162.lA, 

10 c=139,4A, (3=90.1°. 

50. The crystal of any one of claims 42 to 49, wherein the protein has secondary 
structural elements that include .alpha.-helix and antiparallel .beta.-sheets as 
shown in Figure 7, 10, 11 and/or 12. 

15 

51 . The crystal of any one claims 42 to 50 having a three-dimensional structure 
as defined by atomic coordinates shown in Table 1. 

52. The crystal of any one of claims 42 to 51 having a binding cavity as shown in 
20 Figure 6, 8, 9 and/or 13. 

53. A method of using the crystal of any one of claims 42 to 52 in a drug 
screening assay comprising: 

(a) selecting a potential ligand by performing structure assisted drug 

25 design with the three-dimensional structure determined for the crystal, 

wherein said selecting is performed in conjunction with computer 
modeling; optionally 

(b) contacting the potential ligand with the ligand binding domain of 
the ligand-gated receptor in an in vitro or in vivo assay; and 

30 (c) detecting the binding of the potential ligand for the ligand binding 

domain. 

54. The method of claim 53, wherein the ligand-gated receptor is a nicotinic 
acetylcholine receptor. 



35 



55. The method of claim 53 or 54 further comprising: 
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(d) forming a supplemental crystal of a protein-ligand complex by co- 
crystallization or soaking the crystal of the water-soluble ligand-binding 
protein of any one of claims 1 to 21 or 27 to 29, the dimer or pentamer 
of claim 30 or the ligand-gated ion channel of claim 31 , with a potential 

5 drug, wherein the crystal effectively diffracts X-rays for the 

determination of the atomic coordinates of the protein-ligand complex 
to a resolution of greater than 5.0 f preferably greater than 4.0 
Angstroms, more preferably greater than 3; 

(e) determining the three-dimensional structure of the supplemental 
10 crystal; 

(f) selecting a candidate drug by performing a structure assisted drug 
design with the three-dimensional structure determined for the 
supplemental crystal, wherein said selecting is performed in 
conjunction with computer modeling; optionally 

1 5 (g) contacting the candidate drug with a cell that expresses the ligand- 

gated receptor; and 
(h) detecting a cell response; wherein a candidate drug is identified as 
a drug when the cell response is altered compared to a cell that 
has not been contacted with the candidate compound. 

20 

56. The method of any one of claims 53 to 55 further comprising an initial step 
that precedes step (a) wherein said initial step consists of determining the 
three-dimensional structure of a crystal comprising a protein-ligand complex 
formed between the water-soluble ligand-binding protein of any one of claims 

25 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand-gated ion 

channel of claim 31 , and the ligand of the ligand-gated receptor, wherein the 
crystal effectively diffracts X-rays for the determination of the atomic 
coordinates of the protein-ligand complex to a resolution of greater .than 5.0, 
preferably greater than 4.0 Angstroms. 

30 

57. A method of growing a crystal of a protein-ligand complex comprising: 
(a) contacting the water-soluble ligand-binding protein of any one of 

claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the 
ligand-gated ion channel of claim 31 with a ligand of a ligand- 
35 gated receptor, wherein the water-soluble ligand-binding protein 

forms a protein-ligand complex with the ligand; and 
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(b) growing the crystal of the protein-ligand complex; wherein the 
crystal effectively diffracts X-rays for the determination of the 
atomic coordinates of the protein-ligand complex to a resolution 
of greater than 5.0, preferably greater than 4.0 Angstroms. 

5 

58. A drug screening assay comprising soaking the crystal of any one of claims 
42 to 52 in a solution of compounds to be screened and detecting the binding 
of the compound to the ligand-binding protein. 

10 59. The method of claim 57 or 58, wherein said ligand comprises an alkylated 
nitrogen and/or quaternary ammonium ion. 

60. A method of increasing or decreasing the affinity of a drug to a ligand-gated 
receptor, comprising 

1 5 (a) performing structure assisted drug design with the three-dimensional 

structure determined for the crystal of any one of claims 42 to 52, 
wherein said drug design is performed in conjunction with computer 
modeling; and 

(b) modifying said drug to alter or eliminate a portion thereof • 

20 suspected of interacting with a binding site of the binding cavity or 

with a non-specific binding site of the protein in the crystal. 

61 . The method of claim 60, wherein step (a) further comprises the steps of a 
method of any one of claims 53 to 59. 

25 

62. The method of claim 60 or 61 , further comprising after step (b), the additional 
step of : 

(c) repeating the method used to perform structure assisted drug design 
according to step (a) using the modified drug according to step (b). 

30 

63. A method of drug design comprising the step of using the structural 
coordinates of a waterrsoluble ligand-binding protein crystal comprising the 
coordinates of Table 1, to computationally evaluate a chemical entity for 
associating with the ligand-binding site or a non-specific binding site of a 

35 ligand-binding protein. 
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64. The method of any one of claims 53 to 63, wherein the identified drug 
prevents or promotes correct assembly of a ligand-gated ion channel. 

65. The method of any one of claims 53 to 63, wherein the identified drug binds to 
5 a non-specific binding site of a ligand-gated ion channel. 

66. The method of any one of claims 53 to 65 further comprising synthesizing the 
drug in a therapeutically effective amount. 

10 67. A drug produced by the method of claim 66 or a pro-drug thereof. 

68. The drug of claim 67 which interacts with a ligand-gated receptor comprising 
a pentamer of claim 30 with monomers A to E, wherein the drug binds to one 
or more primary contact regions of a monomer (residues from A contacting B) 

1 5 defined by amino acid residues 15 to 21 . 44 to 47, 85 to 87, 91 to 94, 122 to 

124, 143 to 146, 149, 185 to 187 of the mature protein of SEQ ID No. 2 and/or 
to one or more of the complementary contact regions of the other monomer 
(from B contacting A, (identical to residues on A contacting E) defined by 
amino acid residues 3 to 4, 7 to 8, 1 1 , 37 to 39, 53, 75 to 77, 96 to 104, 1 14 to 

20 118 and 163-170 of the mature protein of SEQ ID No. 2; or to the contact 

regions as identified in Figure 14; or to the corresponding contact regions of 
the monomers of a ligand-gated ion channel. 

69. The drug of claim 68, wherein the ligand-gated ion channel is the nicotinic 
25 acetylcholine receptor and the order of the monomers is ayapS. 

70. A computer readable medium comprising a nucleotide sequence of the 
polynucleotide(s) of any one of claims 32 to 34, an amino acid sequence of a 
protein of any one of claims 1 to 21 or 27 to 29, the dimer or pentamer of 

30 claim 30 or the ligand-gated ion channel of claim 31, or the structural 

coordinates of a crystal of any one of claims 40 to 50. 

71 . A device comprising the computer readable medium of claim 70. 

35 72. Use of the computer readable medium of claim 70 or the device of claim 71 
for modeling an antagonist/inhibitor or agonist/activator of a ligand-gated 
receptor. 
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73. Use of the crystal of any one of claims 42 to 52 or its structural coordinates as 
a template for modeling the 3D structure of a ligand-gated ion channel. 

5 74. Use of the polynucleotide(s) of any one of claims 32 to 34, the protein of any 
one of claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30, the 
ligand-gated ion channel of claim 31 , the vector(s) of claim 35, the host cell of 
claim 36, the antigen of claim 37, the antibody of claim 38, an oligonucleotide 
probe of claim 39, the crystal of any one of claims 42 to 52 or a method of any 
10 one of claims 53 to 66 for screening or profiling putative ligands of ligand- 

gated receptors. 

75. Use of an antagonist/inhibitor or agonist/activator identified according to a 
method of any one of claims 53 to 66 for the preparation of a pharmaceutical 

15 composition for the treatment of a ligand-gated ion channel mediated or 

related disorder. 

76. The use of claim 75, wherein the antagonist/inhibitor is or is derived from the 
protein of any one of claims 1 to 21 or 27 to 29, an antigen of claim 37, an 

20 antibody of claim 38 or from a toxin of the ligand-gated ion channel. 

77. The use of claim 75, wherein the agonist/activator is or is derived from a the 
protein of any one of claims 1 to 21 or 27 to 29, an antigen of claim 37, an 
antibody of claim 38 or from epibatidine, acetylcholine, choline, nicotine, 

25 carbachol, serotonin or GABA. 

78. The use of any one claims 75 to 77, wherein the ligand-gated ion channel is 
the nicotinic acetylcholine receptor and said mediated or related disorder is 
Tourette's syndrome, Alzheimer's disease, addiction to nicotine or 

30 schizophrenia. 

79. Use of ligand of a ligand-gated receptor for identifying and isolating a water- 
soluble ligand-binding protein from a mollusc. 



35 80. 



The use of claim 79, wherein said ligand is a-bungarotoxin. 
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L-AChBP_Tl. 
L-AChBP_T2 , 
B-AChBPJTl. 
B-AChBP T2. 



L-AChBP_Tl. 
L-AChBP T2. 
B-AChBPJTl . 
B-AChBP T2. 



— MRRNIFCLACLWIVQACLSLDRADILYNIRQTSRPDVIPTQRDRPVAVSVSLKFINIL 
— MRRNIFCLACLWIVQGCLSLDRADlLYNIRQTSRPDVIPTQRDRPVAVSVSIiKFINIL 
MAELRRIILLLCTIAFHVSHGQIRWTLLNQITGES — DVIPLSNNTPLNVSLNFKLMNIV 
MAELRGIILLLCTIAFHVSHGQIRWTLLNQITGES— DVIPLSNNTPLNVSLNFKLMNIL 



EVNEITNEVDWFWQQTTWSDRTLA-WNSSHSPDQVSVPISSLWVPDLAAYNAISKPEVL 
EVNEITNEVDWFWQQTTWSDRTLA-WNSSHSPDQVSVPISSLWVPDLAAYNAISKPEVL 
EADTEKDQVEWLWTQASWKVPYYSSLLSSSSLDQVSLPVSKMWTPDLSFYNAIAAPELL 
EADTEKDQVEWLWTQASWKVPYYSSLLSSSSLDQVSLPASKMWTPDLSFYNAIAAPELL 
*.: .::*:**:* : ** * ****.♦ * .* ***. ***** **.* 



L-AChBP_Tl. 
L-AChBP_T2 . 
B-AChBP_Tl. 
B-AChBP T2. 



L-AChBP_Tl . 
L-AChBP_T2 . 
B-AChBPJTl. 
B-AChBP T2. 



TPQLARWSDGEVL YMPS I RQRFSCDVSGVDTESGATCRIKI GS WTH HS RE I S VDPTTE- 
TPQLARWSDGEVLYMPSIRQRFSCDVSGVDTESGATCRIKIGSWTHHSGEISVDPTTE- 
SADRVWSKDGSVIYVPSQRVRFTCDLINVDTEPGATCRIKVGSWTHDNKQFALITGEEG 
ST DRVWS KDGS VI YV PSQR VRFTC DLI NVDTE PGATCRI KVGS WTFDNKQLAL I TGE EG 
:.: . * ,**,*:*:** * *+.**. m **** v ******* . **** ^ .... * 

NSDDSEYFSQYSRFEILDVTQKKNSVTYSCCPEAYEDVEVSLNFRKKGRSEIL 
NSDDSEYFSQYSRFEILDVTQKKNSVIYSCCPEAYEDVEVSLNFRKKGRSEIL 

WNIAEYFDS-PKFDLLSATQSLNRKKYSCCENMYDDIEITFAFRKK 

WNIAEYFDS-PKYDLLSATQSLNRKKYRCCENMYEDIEITFAFRKK 

: :***.. .::::*..*+. * * **.. *.*.*... + + + + 
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Average Hydrophobic^ * -,394 




Average Hydrophobic^ = -,350 



Figure 2 
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L-AChBPJTl LDRADILYNIRQTSRPDVIPT 

L-AChBP_T2 LDRADILYNIRQTSRPDVI PT 

B-AChBPJTl QIRWTLLNQITGES — DVIPL 

B-AChBPJT2 QIRWTLLNQITGES— DVIPL 

hjiAChR_A7 MRCSPGGVWLALAASLLHV SLQGEFQRKLYKELVKNYNPLERPV 

h_5HT3 MLLWVQQALLALLLPTLLAQGEARRSRNTTRPALLRLSDYLLTNYRKGVRPV 

h_GlyR_al -MYSFNTLRLYLSGAIVFFSLAASKEAEAARSATKPMSPSDFLDKLMGRTSGYDARIRPN 

h_GABAA_bl MRKSPGLSDCLWAWILLLSTLTGRSYGQPSLQDELKDNTTVFTRILDRLLDGYDNRLRPG 

★ 

L-AChBP__Tl • QRDR-PVAVSVSLKFINILEVNEITNEVDWFWQQTTWSDRTLA — WNSSHSPDQVSVPI 

L-AChBPJT2 QRDR-PVAVSVSLKFINILEVNEITNEVDWFWQQTTWSDRTLA — WNSSHSPDQVSVPI 

B-AChBP_Tl SNNT-PLNVSLNFKLMNIVEADTEKDQVEWLWTQASWKVPYYSS-LLSSSSLDQVSLPV 

B-AChBP_T2 SNNT-PLNVSLNFKLMNILEADTEKDQVEVVLWTQASWKVPYYSS-LLSSSSLDQVSLPA 

h_nAChR__A7 ANDSQPLTVYFSLSLLQIMDVDEKNQVLTTNIWLQMSWTDHYLQWNVSEYPGVKTVRFPD 

h_5HT3 RDWRKPTTVSIDVIVYAILNVDEKNQVLTTYIWYRQYWTDEFLQWNPEDFDNITKLSIPT 

hJ31yR_al FKGP-PVNVSCNIFINSFGSIAETTMDYRVNIFLRQQWNDPRLAYNEYPDDSLDLDPSML 

h_GABAA__bl LGER-VTEVKTDIFVTSFGPVSDHDMEYTIDVFFRQSWKDERLKF-KGPMTVLRLNNLMA 

L-AChBPJTl SSLWVPDLAAYNAISK-PEVLTPQ — LARWSDGEVLYMPSIRQRFSCDVSG-VDTESGA 

L-AChBP_T2 SSLWVPDLAAYNAISK-PEVLTPQ — LARWSDGEVLYMPSIRQRFSCDVSG-VDTESGA 

B-AChBPJTl SKMWTPDLSFYNAIAA- PELLS AD — RWVSKDGSVIYVPSQRVRFTCDLIN-VpTEPGA 

B-AChBP_T2 SKMWTPDLSFYNAIAA- PELLSTD — RVVVSKDGSVIYVPSQRVRFTCDLIN-VDTEPGA 

h_nAChR_A7 GQIWKPDILLYNSADERFDATFHT — NVLVNSSGHCQYLPPGIFKSSCYIDVRWFPFDVQ 

h_5HT3 DSIWVPDILINEFVDV-GKSPNIP — YVYIRHQGEVQNYKPLQVVTACSLDIYNFPFDVQ 

h_GlyR_al DSIWKPDLFFANEKGAHFHEITTDNKLLRISRNGNVLYSIRITLTLACPMDLKNFPMDVQ 

h_GABAA_bl SKIRTPDTFFHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRLTVRAECPMHLEDFPMDAH 
..:**: . : .* * : 

L-AChBPJTl TCRIKIGSWTHHSREISVDPT TE-NSDDSEYFSQYSRFEILDVTQKKNSVTYSCCPE 

L-AChBP__T2 TCRIKIGSWTHHSGEISVDPT TE-NSDDSEYFSQYSRFEILDVTQKKNSVIYSCCPE 

B-AChBPJTl TCRIKVGSWTHDNKQFALITG EEGWNIAEYFDS-PKFDLLSATQSLNRKKYSCCEN 

B-AChBPJT2 TCRIKVGSWTFDNKQLALITG EEGWNIAEYFDS-PKYDLLSATQSLNRKKYRCCEN 

h__nAChR_A7 HCKLKFGSWS YGGWS LDLQMQ EA DISGYIPN-GEWDLVGIPGICRSERFYECCKE 

h_5HT3 NCSLTFTSWLHTIQDINISLWRLPEKVKSDRSVFMNQ-GEWELLGVLPYFREFSMESS-N 

h_GlyR_al TCIMQLESFGYTMNDLIFEWQ EQGAVQVADGLTL-PQFILKEEKDLRYCTKHYNT-G 

hJ3ABAA_bl ACPLKFGSYAYTRAEVVYEWTR-EPARSVWAEDGSRLNQYDLLG — QTVDSGIVQSSTG 



L-AChBPJTl AYEDVEVSLNFRKKGRSEIL 

L-AChBPJT2 AYEDVEVSLNFRKKGRSEIL 

B-AChBPJTl MYDDIEITFAFRKK 

B-AChBP~T2 M YEDI E I T FAFRKK 

h_nAChR_A7 PYPDVTFTVTMRRRTLYYGL 

h_5HT3 YYAEMKFYVVI RRRPLFYVV 

h_GlyR_al KFTCIEARFHLERQMGYYLI 

h_GABAA_bl EYVVMTTHFHLKRKI GYFVI 



Figure 3 
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nAChR_h-A2 MGPSCPVFLSFTKLSLWWLLLTPAGGEEAKRPPPRAPGDP LSSPSPTALPQGGSHTE 

nAChR_h-A4 MELGGPGAPRLLP P LLLLLGTGLLRASSHVE 

nAChR_h-A5 MAARGSGPRALRLLLLVQLVAGRCGLAGAAGGAQRGLSEPS 

nAChR_h-A3 MALAALAAPGAVAPRLLL LLLSLLP--VAR ASE 

nAChR_h-A6 MLTSKGQGFLHGGLCL WLCVFT P — FFKGCVGCA 

nAChR_h-Al MEPWPLLL LFSLCSAGLVLG SE 

nAChR_h-A7 MRCS PGGVWLALAASL LHVSLQG-EFQR 

nAChR_h- A9 CIS FCWI YFAAS RLR AAETADG- KYAQ 

B-AChBP_Tl MAELRRI ILLLCT IAFHVSHG-QIR 

B-AChBP_T2 MAELRGI ILLLCT IAFHVSHG-QIR 

L-AChBP_Tl MRRNIFCLACL WIVQACLS-LDR 

L-AChBP_T2 MRRNIFCLACL WIVQGCLS-LDR 



nAChR_h-A2 T EDRLFKHLFRGYNRWARPVPNTSDVVIVRFGLSIAQLIDVDEKNQMMTTNVWLKQ 

nAChR_h-A4 TRAHAEERLLKKLFSGYNKWSRPVANISDWLVRFGLSIAQLIDVDEKNQMMTTNVWVKQ 

nAChR_h-A5 SIAKHEDSLLKDLFQDYERWVRPVEHLNDKIKIKFGLAISQLVDVDEKNQLMTTNVWLKQ 

nAChR_h-A3 A EHRLFERLFE D YNEI I RPVAN VS DPVI IH FEVSMSQLVKVDEVNQIMETNLWLKQ 

nAChR_h-A6 T EERLFHKLFSHYNQFIRPVENVSDPVTVHFEVAITQLANVDEVNQIMETNLWLRH 

nAChRJi-Al H- ETRLVAKLFKDYSSVVRPVEDHRQWEVTVGLQLIQLINVDEVNQIVTTNVRLKQ 

nAChRJl-A7 KLYKELVKNYNPLERPVANDSQPLTVYFSLSLLQIMDVDEKNQVLTTNIWLQM 

nAChR_h-A9 KLFNDLFEDYSNALRPVEDTDKVLNVTLQITLSQIKDMDERNQILTAYLWIRQ 
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PTSEVTLOKAL, . . . DAKGEREVKEI VrNK-DAFTEKGQWSIBHKPSRKJJWR . SDDP SYBDVTPYLXXQRXPLFYXV 

DRTBXDLVLKS BVASLOO .i.BPTPS<3BttDIVALPGRRNBN . PODS . T YVDITYDFIiXRRKPLFYTl 

NAHBWNXfQLSA .BE. . GBAVSWraiDPHSpTBMGBWTXRHRPAKKKYN. {fQLTKDOTD . . FQEI IFFLIX0RRPLFyi3 

DANBXiTUDLMT. . DTXDGKDYPIBBiriOPESPTENGBWEIIHKPAKKNXY . PDKPPNGTN . . YQDVTFYLl'lRRKPLFY VI 
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TED D LML YWKK G N DSLKTOERISLSQFJiIQBFHTTTK . . .LAPYSS. . TGWYMRLYINFtfXiRRHIPPPLL 

TDBOLM^HWICWGD KBLKTjDBRISLSQPLXQKFHTTflR . . . tAPYSS . . TGWYNRLYINFT1.RRHI PFFLL 

TRABViVXBWTR. . . BPAR S V WhjB&QS RLNQYDL LG QTVDB G I V 0 S3 TG BY VVMTTHFHXjKRKXGYFVX 
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SEQUENCE LISTING 



<110> 



Stichting voor de Technische Wetenschappen 



<120> 



Water-soluble ligand-binding proteins and analogs of 



ligand-gated ion channels, crystals thereof and their 
use for screening ligands of ligand-gated ion channels 

<130> F 1105 PCT 

<140> 
<141> 

<160> 20 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 690 
<212> DNA 

<213> Lymnaea stagnalis 

<220> 

<221> CDS 

<222> (1) . . (687) 

<220> 

<221> mat_peptide 
<222> (58) . . (687) 

<400> 1 

atg cgt cga aac att ttc tgc ctt get tgt etc tgg ate gtg caa gcg 48 
Met Arg Arg Asn lie Phe Cys Leu Ala Cys Leu Trp lie Val Gin Ala 



-15 



-10 



-5 



tgt eta age ttg gac 
Cys Leu Ser Leu Asp 
-1 1 



egg gca gac ate ttg tac aac ata cgt cag aca 
Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
5 10 



96 
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teg aga ccg gat gtg att ccc aca cag cga gat cgc cca gtg gcg'gtg 144 
Ser Arg Pro Asp Val lie Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

tec gtc tct ttg aag ttc ate aac ate ttg gaa gtg aat gaa ata ace 192 
Ser Val Ser Leu i,ys Phe lie Asn lie Leu Glu Val Asn Glu lie Thr 
30 35 40 45 

aat gaa gtg gac gtg gtc ttt tgg cag cag acg aca tgg teg gac agg 240 
Asn Glu Val Asp Val Val Phe Trp <31n Gin Thr Thr Trp Ser Asp Arg 
50 55 60 



acc etc gee tgg aac agt tct cac tea cca gat cag gtt tec gtg cca 288 
Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 



ata age tct ttg tgg gtg cct gac etc get gca tac aac gee ate teg 336 
He Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyx Asn Ala lie Ser 
80 85 90 

aaa cct gaa gtc ctt aca ccg caa ctg gee agg gtc gta tec gat ggt 384 
Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

gaa gtg ctg tac atg ccg agt ate cgc cag egg ttc tec tgc gat gta 432 
Glu Val Leu Tyr Met Pro Ser He Arg Gin Arg Phe Ser Cys Asp Val 
HO 115 120 125 



teg ggt gtc gat acg gag tec ggt get aca tgt egg ate aaa att ggt 480 
Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys He Gly 
130 135 140 



tec tgg acc cac cac agt aga gag att tct gta gat ccc acg aca gaa 528 
Ser Trp Thr His His Ser Arg Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 



aat agt gat gat tct gaa tac ttc tec caa tac tct cgc ttt gaa ate 
Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 



576 
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ttg gac gtc aca cag aag aag aac teg gtt acc tac tct tgc tgt ccg 624 
Leu Asp Val Thr Gin Lys Lys Asn Ser Val Thr Tyr Ser Cys Cys Pro 
175 180 185 

gag gca tac gag gac gtt gaa gtg agt etc aat ttc egg aag aag gga 672 
Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 

cgc tec gaa att ctt tag 690 
Arg Ser Glu lie Leu 
210 



<210> 2 
<211> 229 
<212> PRT 

<213> Lymnaea stagnalis 
<400> 2 

Met Arg Arg Asn He Phe Cys Leu Ala Cys Leu Trp He Val Gin Ala 
-15 -10 -5 

Cys Leu Ser Leu Asp Arg Ala Asp lie Leu Tyr Asn He Arg Gin Thr 
-1 1 5 10 

Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 
30 35 40 45 

Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 
50 55 60 

Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

He Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala lie Ser 
80 85 90 



J 



(4/20) 



Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

Glu Val Leu Tyr Met Pro Ser lie Arg Gin Arg Phe Ser Cys Asp Val 
HO 115 120 125 

Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys lie Gly 
130 135 140 

Ser Trp Thr His His Ser Arg Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

Leu Asp Val Thr Gin Lys Lys Asn Ser Val Thr Tyr Ser Cys Cys Pro 
175 180 185 

Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 

Arg Ser Glu He Leu 
210 



<210> 3 
<211> 690 
<212> DNA 

<213> Lymnaea stagnalis 

<220> 

<221> CDS 

<222> (1) . . (687) 

<220> 

<221> mat__peptide 
<222> (58).. (687) 



<400> 3 
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atg cgt cga aac att ttc tgc ctt get tgt etc tgg ate gtg caa ggg 48 
Met Arg Arg Asn lie Phe Cys Leu Ala Cys Leu Trp He Val Gin Gly 
-15 -10 -5 

tgt eta age ttg gac egg gca gac ate ttg tac aac ata cgt cag aca 96 
Cys Leu Ser Leu Asp Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
-11 5 m 10 

teg aga ccg gat gtg att ccc aca cag cga gat cgc cca gtg gcg gtg 144 
Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

tec gtc tct ttg aag ttc ate aac ate ttg gaa gtg aat gaa ata acc 192 
Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 
30 35 40 45 

aat gaa gtg gac gtg gtc ttt tgg cag cag acg aca tgg teg gac agg 240 
Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 
50 55 60 

acc etc gee tgg aac agt tct cac tea cca gat cag gtt tec gtg cca 288 
Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

ata age tct ttg tgg gtg cct gac etc get gca tac aac gec ate teg 336 
lie Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala He Ser 
80 85 90 

aaa cct gaa gtc ctt aca ccg caa ctg gee agg gtc gta tec gat ggt 3 84 
Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

gaa gtg ctg tac atg ccg agt ate cgc cag egg ttc tec tgc gat gta 432 
Glu Val Leu Tyr Met Pro Ser He Arg Gin Arg Phe Ser Cys Asp Val 
110 115 120 125 

teg ggt gtc gat acg gag tec ggt get acg tgt egg ate aaa att ggt 480 
Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys He Gly 
130 135 140 
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(6/20) 

tec tgg acc cac cac agt gga gag att tct gta gat ccc acg aca gaa 528 

Ser Trp Thr His His Ser Gly Glu He Ser Val Asp Pro Thr Thr Glu 

145 150 155 

aat agt gat gat tct gaa tac ttc tec caa tac tct cgc ttt gaa ate 576 
Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

ttg gac gtc aca cag aag aag aac teg gtt ate tac tct tgc tgt ccg 624 
Leu Asp Val Thr Gin Lys Lys Asn Ser Val He Tyr Ser Cys Cys Pro 
175 180 185 

gag gca tac gag gac gtt gaa gtg agt etc aat ttc egg aag aag gga 672 
Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
19 0 195 200 205 

cgc tec gaa att ctt tag 590 
Arg Ser Glu He Leu 
210 



<210> 4 
<211> 229 
<212> PRT 

<213> Lymnaea stagnalis 
<400> 4 

Met Arg Arg Asn He Phe Cys Leu Ala Cys Leu Trp He Val Gin Gly 
-15 -10 -5 

Cys Leu Ser Leu Asp Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
-11 5 10 

Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

Ser Val Ser Leu Lys Phe He Asn He Leu Glu Veil Asn Glu He Thr 
30 35 40 45 

Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 



(7/20) 



50 



55 



60 



Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

He Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala He Ser 
80 85 90 

Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

Glu Val Leu Tyr Met Pro Ser He Arg Gin Arg Phe Ser Cys Asp Val 
110 115 120 125 

Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys He Gly 
130 135 140 

Ser Trp Thr His His Ser Gly Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

Leu Asp Val Thr Gin Lys Lys Asn Ser Val He Tyr Ser Cys Cys Pro 
175 180 185 

Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 



Arg Ser Glu He Leu 
210 



<210> 5 
<211> 675 
<212> DNA 

<213> Bulinus truncatus 



<220> 
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<221> CDS 

<222> (1) . . (672) 



<220> 

<221> mat_peptide 
<222> (64) . . (672) 

<400> 5 

atg get gaa eta cga agg ate att ctt ctg eta tgt act att gee ttt 48 

Met Ala Glu Leu Arg Arg lie lie Leu Leu Leu Cys Thr lie Ala Phe 

-20 -15 -10 

cat gtt tec cat gga caa ata aga tgg acg ctg ctg aat cag ate acc 96 
His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 

ggt gaa tct gac gtc att ccg ctg tct aac aac acg ccc ctg aat gtg 144 
Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 20 25 

teg ctg aat ttt aag ctg atg aat ate gta gag gcg gac aca gaa aaa 192 
Ser Leu Asn Phe Lys Leu Met Asn He Val Glu Ala Asp Thr Glu Lys 
30 35 40 



gat caa gtg gag gtc gtg ctg tgg aca cag get age tgg aaa gtg ccg 240 
Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 

tat tac age tea ctg ctg tec tct age agt tta gac cag gtg age tta 288 
Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 



cca gtc age aaa atg tgg acc cca gac ctt tct ttc tac aac gee ate 33 6 
Pro Val Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 



get gca ccc gag ttg 
Ala Ala Pro Glu Leu 
95 



etc tec gca gac cgc gtg gtg gtc tct aag gac 
Leu Ser Ala Asp Arg Val Val Val Ser Lys Asp 
100 105 



384 
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ggg age gtc att tac gtc ccc age cag agg gtc cgt ttc acc tgc gac 432 
Gly Ser Val He Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
HO 115 120 

ctt att aat gtc gac acg gag ccg gga gec acc tgt cgc ate aaa gtc 480 
Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

gga tec tgg acc cac gac aac aaa cag ttc gec ctg ate acc ggg gag 528 
Gly Ser Trp Thr His Asp Asn Lys Gin Phe Ala Leu He Thr Gly Glu 
140 145 150 155 

gag ggg gtg gtg aat att gca gag tac ttc gac age cca aag ttt gac 576 
Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Phe Asp 
160 165 170 



ctt ttg agt gee aca cag agt ctg aat cgc aag aag tac age tgt tgc 624 
Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Ser Cys Cys 
175 180 185 

gag aat atg tat gat gac att gaa att acc ttt gca ttc aga aag aag 672 
Glu Asn Met Tyr Asp Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 

taa 6 75 

<210> 6 
<211> 224 
<212> PRT 

<213> Bulinus truncatus 



<400> 6 

Met Ala Glu Leu Arg Arg He He Leu Leu Leu Cys Thr He Ala Phe 
-20 -15 -10 

His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 

Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 



(10/20) 

15 20 25 

Ser Leu Asn Phe Lys Leu Met Asn He Val Glu Ala Asp Thr Glu Lys 
30 35 40 

Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 

Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 



Pro Val Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 

Ala Ala Pro Glu Leu Leu Ser Ala Asp Arg Val Val Val Ser Lys Asp 
95 100 105 

Gly Ser Val He Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
HO 115 120 

Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 



Gly Ser Trp Thr His Asp Asn Lys Gin Phe Ala Leu He Thr Gly Glu 
140 145 150 155 

Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Phe Asp 
160 165 170 

Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Ser Cys Cys 
175 180 185 

Glu Asn Met Tyr Asp Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 



<210> 7 
<211> 675 
<212> DNA 
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<213> Bulinus truncatus 

<220> 

<221> CDS 

<222> (1) . . (672) 

<220> 

<221> mat_peptide 
<222> (64) . . (672) 



<400> 7 

atg get gaa eta cga 
Met Ala Glu Leu Arg 
-20 



ggg ate att ctt ctg eta 
Gly He He Leu Leu Leu 
-15 



tgt act att gee ttt 48 
Cys Thr He Ala Phe 
^10 



cat gtt tec cat gga caa ata aga tgg acg ctg ctg aat cag ate acc 96 
His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 

ggt gaa tct gac gtc att ccg ctg tct aac aac acg cca ctg aat gtg 144 
Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 20 25 



teg ctg aat ttt aag ctg atg aat ate tta gag gcg gac aca gag aaa 192 
Ser Leu Asn Phe Lys Leu Met Asn He Leu Glu Ala Asp Thr Glu Lys 
30 35 40 

gat caa gtg gag gtc gtg ctg tgg aca cag get age tgg aaa gtg ccg 240 
Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 . 50 55 



tat tac age tea ctg ctg tec tct age agt tta gac cag gtg age tta 288 
Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 

cca gee age aaa atg tgg acc cca gac ctt tct ttc tat aac gee ate 33 6 
Pro Ala Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 



get gca ccc gag ttg etc tec aca gac cgc gtg gtg gtc tct aag gac 



384 
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Ala Ala Pro Glu Leu Leu Ser Thr Asp Arg Val Val Val Ser Lys Asp 
95 100 105 

ggg age gtc att tac gtg ccc age cag agg gtc cgt ttc acc tgc gac 432 
Gly Ser Val lie Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
110 115 120 

ctt att aat gtg gac acg gag ccg gga gec acc tgt cgc ate aaa gtc 480 
Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

gga tec tgg acc ttc gac aac aaa cag etc gee ctg ate acc ggg gag 528 
Gly Ser Trp Thr Phe Asp Asn Lys Gin Leu Ala Leu He Thr Gly Glu 
140 145 150 155 

gag ggg gtg gtg aat att gca gag tac ttc gac age cca aag tac gac 576 
Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Tyr Asp 
160 165 170 

ctt ttg agt gee aca cag agt ctg aat cgc aag aag tac aga tgt tgc 624 
Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Arg Cys Cys 
175 180 185 

gag aat atg tat gaa gac att gaa att acc ttt gca ttc aga aag aag 672 
Glu Asn Met Tyr Glu Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 

taa 675 



<210> 8 
<211> 224 
<212> PRT 

<213> Bulinus truncatus 
<400> 8 

Met Ala Glu Leu Arg Gly He He Leu Leu Leu Cys Thr He Ala Phe 
-20 -15 -10 

His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 



(13/20) 

"5 -11 5 10 

Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 20 25 

Ser Leu Asn Phe Lys Leu Met Asn lie Leu Glu Ala Asp Thr Glu Lys 
30 35 40 

Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 



Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
6 0 65 70 75 

Pro Ala Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 

Ala Ala Pro Glu Leu Leu Ser Thr Asp Arg Val Val Val Ser Lys Asp 
95 100 105 

Gly Ser Val He Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
HO 115 120 



Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

Gly Ser Trp Thr Phe Asp Asn Lys Gin Leu Ala Leu He Thr Gly Glu 
140 145 150 155 

Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Tyr Asp 
160 165 170 

Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Arg Cys Cys 
175 180 185 



Glu Asn Met Tyr Glu Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 • 200 



(14/20) 



<210> 9 

<211> 502 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 
<222> (1) . . (235) 

<400> 9 

Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser Leu 
15 10 15 

Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr Lys Glu 
20 25 30 

Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala Asn Asp Ser 
35 40 45 

Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin lie Met Asp 
50 55 60 

Val Asp Glu Lys Asn Gin Val Leu Thr Thr Asn lie Trp Leu Gin Met 
65 70 75 80 

Ser Trp Thr Asp His Tyr Leu Gin Trp Asn Val Ser Glu Tyr Pro Gly 
85. 90 95 

Val Lys Thr Val Arg Phe Pro Asp Gly Gin He Trp Lys Pro Asp He 
100 105 no 

Leu Leu Tyr Asn Ser Ala Asp Glu Arg Phe Asp Ala Thr Phe His Thr 
115 120 125 

Asn Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Leu Pro Pro Gly 
130 135 140 



He Phe Lys Ser Ser Cys Tyr He Asp Val Arg Trp Phe Pro Phe Asp 
145 150 155 160 



(15/20) 



Val Gin His Cys Lys Leu Lys Phe Gly Ser Trp Ser Tyr Gly Gly Trp 
165 170 175 

Ser Leu Asp Leu Gin Met Gin Glu Ala Asp He Ser Gly Tyr He Pro 
180 185 190 

Asn Gly Glu Trp Asp Leu Val Gly He Pro Gly Lys Arg Ser Glu Arg 
195 200 205 

Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr Phe Thr Val 
210 215 220 

Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Asn Leu Leu He Pro 
225 230 235 240 

Cys Val Leu He Ser Ala Leu Ala Leu Leu Val Phe Leu Leu Pro Ala 
245 250 255 

Asp Ser Gly Glu Lys He Ser Leu Gly He Thr Val Leu Leu Ser Leu 
260 265 270 



Thr Val Phe Met Leu Leu Val Ala Glu lie Met Pro Ala Thr Ser Asp 
275 280 285 



Ser Val Pro Leu He Ala Gin Tyr Phe Ala Ser Thr Met He He Val 
290 295 300 

Gly Leu Ser Val Val Val Thr Val He Val Leu Gin Tyr His His His 
305 310 315 320 

Asp Pro Asp Gly Gly Lys Met Pro Lys Trp Thr Arg Val He Leu Leu 
325 330 335 

Asn Trp Cys Ala Trp Phe Leu Arg Met Lys Arg Pro Gly Glu Asp Lys 
340 345 350 

Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser Leu Ala Ser 
355 360 365 

Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asn Gly Asn Leu 



(16/20) 



370 375 380 

Leu Tyr He Gly Phe Arg Gly Leu Asp Gly Val His Cys Val Pro Thr 
385 390 395 400 

Pro Asp Ser Gly Val Val Cys Gly Arg Met Ala Cys Ser Pro Thr His 
405 410' 415 

Asp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly Asp Pro Asp 
420 425 430 

Leu Ala Lys lie Leu Glu Glu Val Arg Tyr He Ala Asn Arg Phe Arg 
435 440 445 

Cys Gin Asp Glu Ser Glu Ala Val Cys Ser Glu Trp Lys Phe Ala Ala 
450 455 460 

Cys Val Val Asp Arg Leu Cys Leu Met Ala Phe Ser Val Phe Thr He 
465 470 475 480 

He Cys Thr He Gly He Leu Met Ser Ala Pro Asn Phe Val Glu Ala 
485 490 495 

Val Ser Lys Asp Phe Ala 
500 



<210> 10 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: N-terminus of 
mature LAChBPl 

<400> 10 

Leu Asp Arg Ala Asp He Leu Tyr Asn He 
15 10 
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<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotides encoding N- terminal peptide of 
IiAChBPl 

<220> 

<221> modif ied_base 
<222> (13) 
<223> i 

<400> 11 

cggatccgay mgagcngaya thytntayaa ya 

<210> 12 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primerl useful 
for cloning cDNA encoding LAChBP (optionally with 
Primer 2) 

<220> 

<221> modified_base 
<222> (14) 
<223> i 

<220> 

<221> modif ied_base 
<222> (20) 
<223> i 
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<400> 12 

gcgaattcga yacagarwsa ggngcnacnt g 31 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer2 
useful for cloning cDNA encoding LAChBP 
(optionally with Primerl) 

<220> 

<221> modified_base 
<222> (20) 
<223> i 

<400> 13 

gcgaagcttc rtcytcrtaa gcytcngcrc arc 33 

<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: His-tag 
<400> 14 

Ser Arg Gly His His His His His His 
1 5 

<210> 15 
<2U> 14 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: His-tag 
<400> 15 

Glu Phe Lys Asp Asp Asp Asp Lys His His His His His His 
15 10 

<210> 16 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Additonal 

amino acids at the N- terminus of mature LAChBP due 
to alpha-mating factor cleavage site 

<400> 16 
Glu Ala Glu Ala 
1 

<210> 17 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/ alpha 7 nAChR chimera 

<400> 17 

gcgctcgaga aaagagaggc tgaagctttg gaccgggcag acatctt 

<210> 18 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 18 

cgcgaattca agaatttcgg agcgtccctt 

<210> 19 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 19 

gtggaaacca gacattctcc tctacaacgc catctcgaaa cc 

<210> 20 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 20 

gaggagaatg tctggtttcc acaaagagct tattggcac 
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